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FIG. 1A 



The Small Island: 10848bp (SEQ ID NO: 1) 

TTTTCX3AAGGGGCGAGTTGCACAGGCCTAGGAAGGGGTGTTGCACCTTTTACTTGTC«G» 

CCCGOVTTGACGTGATGTCyUVTATGGAATAGCCCGCCrATTTAGGTGGGCTTTTCTGTTTTTCAC^ 

GGaSTAGCAATGTGCCCTCGACGTAATCAGCGATTACAGAGGGTACTGCGGTTTCGGCCTTGAAATTGCGGOT 

TGCCGGCGCTGGAATCCAGGOSCTCrCGGTAATTCCAGATTTTTCCTACAGGCCrAGCGAACTGCTGGGAGGCGC^ 

TCGTCCGGTAGCOSGCCCCTCAGTATCXXSTATCGCTAGGaSAGACTGCTTaMGCGGTAGATC 

TTGCGGTGGAAGCCGCGGAGGCCGTGGCGGCACGGTTCGATAGGTGTTGAGCCATCAGAGTGAACAGC^ 

TGTCAGGTGGCTGTGGTAGCGCTTCTTCCCACGCGGTTGCATCTCCCTGAATGCCTG 

»GGTTCTGTCCTAGCCTCCTGTGGCGGCGCTTCACCCATTCGAGCGAGAGGTTGCCCGCATGGCTTCGGTTTGGCAGATC 

GCGGGGAGCCGCTGTAATCCACCTTCCTGACGTTCTCCAGCGCTCCTTGGGCCAGGCTTCTGGCATGCAGCACGGCACGC^ 

GGTCGTAGTCGGCCTCGATGTAGCTCATGGTGGTCTGGATATTCGAGTGGTTGAGCAGGCACTTCGTGAGGTGGATGTTCCGCT 

CGGGTGCCTTCATCAAGTCGGTGGCCAGGGTGTGCCGGAAACGGTGCGGAGTCATCCGCACCCCAACCTTC^^ 

GGTACATGGCTTCGACCTGGTCGGAGTTCATCACCTTGCTCTTGTAGTGCGGTGAGAACCGGTTGACGTTGAA^^ 

»TCGGCGAATCCXX5CTCTATCGGCCTCCTGCAGGAGCCGCGATAGGTGAGGCACAAGCCCCTCC^ 

CTTTATGAGTTTTCTCTGTCTCGCCGCGGATGAGGATCAGTTGATTTTCCCAGTCGATGTCGCGCTTGCGGATGCAaU^CAG^^ 

CaVTTGAACCGGATGCCGGTGAAGTAGAAGACCTO^CGTGCAAAGCCAGAACCAGGCGGGAGTGATGCGTGCG 

TGCAGCGCTCGGCGCCGACCTGCATGTTGAGCCAATTGCGGGCGCGCAGGATGGCTTCGGCT 

TGGGGGGGATGACGGTGGTCTTTCTGAACGGGTTGACTTGGGAGTGTGTCACCAGCTCGTGCTCGATGGCATAGCCCCAGATC^ 

TTCGCaiGATGATTCGAATACGTGTTCCAGCTCCGCTTCGACAGGCCTTGTTCCAGGACCTTGCGCCGCCATCCCA 

TGtGGTCCACCTCCTGTACGGTTGCCGTAGGGCCGAAGTGCTTGAGCAGCGCCTTGGTCGCGGCGCGGTAGATCTTCGCGCTGG 

CTTCTCGGAGATCGTGCGCGAAGATGTACTCCTCGGTGAGCTGCTGCGGCGTCATTCCACACCTCCTTCGGCATCGGTGATGAC 

CGTGAGGCTTGGGTTGTCCT^GAGGCTGCACAGGGAACAGCAATTTGGGATCCTGGAGCAGGTAGGCCTTGAGCTCTTTCGTCTT 

GCGAGGACCAGAAACCTTGATGGTCCAGATAtTCAGGTTCTTACTGGTCTTTCGGTGAAGACCCTGTTTCTCGAACGCGCGCTC 

CACCAGCTTCCAGCCGGCGGTCTCCTTGGCCTGGGCCAGCTTTTa^CCTCCGGATGCTCCTGGACATAACGCTTGAAGATTCC 

TGGCGTGACCAGCATGGCGGTCCCGTCGACGGTATGCACCAAAGCCTTGGTGTaSTTGATGAACAGGCGACGG^ 

AGATTTCATCCAGCCAATGAATCCCTGTCCCAGATCAGTGCTTCTGCTAGGCATAAATACATCTTa^TT^ 

ATCTGTAGTGGTCAACCCATCCCGGACAGTGGGTTGAGTTTTTCTTCGGCCACCX3CAGGTGGTAACCCGTCGTTGAATTGATGC 

GGCCTGATCCAGTTGTAGCGGTGCATCAAGTAATGACTGATGTCCCGTTGGGCCTCCTGCGCCGTCAGGTAACCCGTTGACGG^ 

ACCCACTCCGACTTCAGACTGCGGAACAGGCGCTCCATCGGCGAGTTATCCCT^GCAATTCCCCCGACGGCTCATGCTCTG 

ATCCGATAGCGCCAGAGCCGTTGCCGAAACAGGCGGCTGGCGTACTGGCTGCCCTGGTCTGAATGGAACAGCACCTGCTGTGTC 

CTGCCGCGCTGTTCGTAGGCCATGTCCAGGGCCTTGATCACCAGTTCGGCATCCGGCTTGGCCGAGAACGCCCAGCCGATCAGC 

AGATCCAGCACCGCGGCCAGGTAGTGCCAACGGCCTTGCGCCCAGACGTAGGTGATGTCGCCACACCACACCT 

TCGGTCGCGAATTCGCGGTTCAGCCGATTCGGGATATCCGGCCGCTCAACCX3TGGCCTGTTTGTAGGCGTGCGAGCCCGGTTGC 

TTGCTGACCAGGCCCAGCTOVCGCATCAACCGACGCACTCGGAAACGGCCGATGGTCACGCCCTCT 

ATGCTGCGGCTGCCGGCCGAGCCCCGACTCTGGCTGAACAACTGGTTGACTTGGCTGCGTAGCGCCACGCGGCGAGC^ 

CGCCGCCGTCGAAGACGGTGGACGTAGTAGCAAGACCXSCGCavaVTCGAAAGCTGAGCAGACCACTTC 

CTCAACTGGTCTATCAGCGCGTACGATCGAGTTCGTCCGACATCAAGAGAGCGGTAGCCTTTTTTAATATCGCTTTCTCCCGCT 

Ca^TCGGTTGATCCGGGCTTCGAGCTCCTGGATCTTTTGCTGCTCAGGCGTCAAaS^ 

GCTCCGCCTCGAGCTGCTTCACCCAACGGCGCAAGGCCGAATC<^CCACCCCCAGCGAACGGC7^GGCGTCG^ 

CTTGGTCCAACACCAGGGCCGCGGCCTCTCGTTTGAACTO^CGGAAAACGTACGTCGTTGCTTGCTCATCAGACACCTCTTT^ 

ACGGCGAGGATTCTCGCCTAAATCGGTGTCCGGGATCAGTAGACCACTACACTGCCGAAACAGTTATTTTTGTTGCTAAGAGGC 

AGGAAGGCTGCGCTGA6ATTCCTACTAGTGAATTAGATAGTTGTTGTCTTCTGAAACAGAACTGAAGCGAAATTGGGGGTAGGG 

TTTTTCTAGGTGAAGGGTAACTCTTGAGAATTACAAAGGTGTGACGCAAATGTTGTATTTTTCTTGCAGTAT^ 

TTGGGTCGGATATAGGTACTTCTCtCTATTTTCTTTAATTGCTCTCATCTATGGGTGTGTCGGTGGTGGAGGTGGATCGGATGA 

GATTGGGCAGCACTGCTTTGAGAGAGAGCAAAAGCTTTCaSGAGTTAATGATA^ 

GAACTGCGATCCAATTGAAGGTCGTCTTCTTGAATCAGAGAAGCTGATAAGAAAGCCGCCCAATGAGCTGGGTATT^ 

TAATAATTTTTAGCCTGGAGAGGTCTTTTTGAAAATGAAAAAATCACTTGTTATGTCGGCTGTGCTtTTGGTGGCTAGC^ 

CGCGTGTGCTGATGAGGGCTCAAATGATGGAAGTGAGATATGTCGGGCGCAGGGTGGAGTTGAAATAACAAGTCTGC^ 

CTCAAAGGGTGTGGATGTTGAAGATGTTGTAGTTTGTTCGATTCTTCCAAGTAATATGAAGTCGAGTCMAGAGCGCCTACA^ 

CCCTCCTCTGCAAAGGATGATCATTTCGGCAATGCCTTCy^CCAGGAACGGTCACTGTTTCT 

AACATCTTGCCGGGCAAATCTTTATGCTCCACGTTATGCCAATTTCTATCCAGACGGTGTTAGCTVGGGGAACA^ 

ATGTGTTGGTTACAATACACCCGGGAATTCATCTCy^GGGTGTAATGTGTCATGGGACGGCCCGACCGACATTa^ 

TGAGCCATATGGCGGATCTGTTGTTGTAAACTACAGTTGCACTGCATTCAAAACAACGATTCCAGTGATAATGAGCTAC^ 

TCGTGATGGGCGGGCAGTGTATGGCGAGGTCCAGAATGTGTCAGGAATAATAAATGTGGTTTTGAACTAAGGATGCGCCTTAGT 

AGGATTTTATTTCAAGGTGTTCAGCTTGTGAATGTGCTGCAGGGCACTCTATAGAGTGCCCTGCATAGGCTGGAAGTCGTTTGG 

CTGAGGGCTAGTCTTTCGAGTTCGATATGGCACGTTGATGAAGGCTGGTAAAAAGTCCTGTCTGTGTGGGTGGCAGGCGAGTGG 

CGCGTTCAGACGCTAATGTGGAGTTCAAGGTAGGGTTCCGACGCCGAGGATCCTCTTTGGGTGGAGTCATTTGCTGAGCCATGC 

AGTTATAAGACAGGCTACGTGCTCCATTGACTTGGACATCGATACACGAAGATTCT^GATTGTTCCGAGTGTGACTCTGCGGGra 

GGGAGGCAGmGCCTGCTGTGCCAAACCAACTATAGGCAACT^GAATATAATTCG;^ 

TAACCCTGAGGGTTGGATGAGACTCAATAGTAACCTCAAGTTATGAGGAAAGATGAACCGCCTATCGTTTTGTAGTCTTGGTTA 

GGTCy^GGCACACGCTAGTGAGTAGTACTAAGAGTAAGCCGATAGCCAGGGGGCGTGGTGGCCCT^TTTGGGGAAGTC 

GTGCGGGCTTGTACCGGTTCGAGGAAGGAATAGACyVGCAGACAGGATCGCTTGCGATGGGGCAGCAG^^ 

CGTATCCAGAACTGCTGCTTGCAGCGTTAGGGGTGACTCCCTCATGCCCTAGAGGGCATAATCCCTGTATGGAGATGGCTM 

GATATTAAGGAGTGGTCGAAGTCATGGAACGCTTGCTCGAGAGCATTTACATCAATGCCCGGCCGGCGATG^ 

GCCTCACCAGCTCCXJGCCGCAAGAGAATGGTAAAGATTGTGGATGGGGAGGAGGTCGAGGTTCTGCCAGGTGAAGTGC^ 
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FIG. 1B 

tcctggaggcccaaaagagggatgttggaatcctcgccgacttcttagccaagagtctcgtggcgcgacgctagagatccttgg 

ctagccagtctatcgccgacagcaacgtggtctcgtcggccatgccttcaagcttcacgcttcgtcctgtctggttgtcccgga 

tgacgagggtcggggtcgctgtaatgcccttcgactgtgcggtgtcgatgttcgtaagcgtccggggaaggcacattgcgatgc 

gctaagctgtaatcctttaggattgaggtctcagcatggaatgccacgttcgtcccgccacgagcagagatgcagcagcgataa 

gctgcgtagttatagccgccctgcgtgagtcaaattcacaggactatccgcctgatgtgatcgctcaggttgagcagagctttt 

ctcctgaagccatcaccacacagcttacgaagcgtagggtcttcgtagccttattgggcgaaaacattattggc^ 

tcgacggtgacgtcgtcagaagtgttttcgttgacccagctcaccagaaaggcggtatcgggcggcatttgatggatgtcatt 

atacaactgctgccagcgcgggagttggagctgtacgtgtgccatcgtcgattacagctgaaaggttttataccgcattgggtt 

atcagaaaatccgcgacgagtttcatggggcggagcgcaccatcgttatggagaagcggctgtaggattattccagactattta 

cccacttatgcggcaacaaatcagtgatctcactggcccgctgcgtgggcagccgcgtctvgcacatctttgagatagg^ 

gatcatgcccattcatacgcgccgactagatcaggctcatgattgcagccgcccgtttacct^ctgcgcagcgacccggca;^ 

gccagttcgaacgcccgagcgcccatggccgtatttggttctcgacctgattgttatcgatgggcacagccccatcgtccaggt 

agcgcgtcagcgctacccagcgtttcaggctgtaatcgagggctttggccgtggctgatccattgggcaccaggtctcgctgag 

ccaacatccagtcatgcagttttttgaggatcggcaccgccaattcttgtcgtattcgccagcgctcttcatcactcatgtc 

gcgcctgacgttcgacctcgtacaagccgctgatcgagtgcagggcctgttcagccagctgacttttgttcgccacgtgcaaat 

CAAAGAACTTGCGCCGGGCGTGGGCCATGCAGCCGATTTCAGTGATGCCTTGTTCGAAACCGGCTTTGTAGCCAGCGAAGTCGT 

cgcagaccagcttgccgttccagtcacccaggaagttgcgcgcatgttcgccagcacggcttgggctgaagtcgtaaaccaccg 

CTTTGAGCCCTGAAAACGGCGTCGTGCTGTACGCCCAGACATAGGCCCGGT6GGTTTTCTTCTCGCCTGGCGCAAGCATTTGCA 

ccggtgtttcatcagcgtggatcacgccctggttcagcacggcttcacgcagtgcatcgaccagtggctgaagcto 

tttgtccgacccactgcgccagggtcgagcgagcaattgccagcccggcgcggcgaaagattttctcctgccggtacagcggca 

agtgatcggcaaacttggccaccatcacgtgggccaacaaacctgcggtcgggatacctttgtcgattgacctgccccc 

tcagtaccacgctgaacttagtagaatccgttttccaagcaggagacggc7vgtgcagaagcgttcttctcgacttcctcaagcg 

ggtcgaggctagtgtaccgaccgagctgcacgcccaactggtggaactggcccaggagtatcgacgctgtagctaacgccgcct 

gcacatcctgctgtggcgtgcaggtcaaccacaagcggatttaccggctgtaccgagccgccggcttgatggtgag^ 

GGGTCGCGGTGGCGCGCGAATGCCTGAGCCTGTCGAGCGCACCGAACCAGGTCTTGTCGATGGATTTCGTCTTCGAaSOT 

GCACTGGGCGACGGATCTVAATGCCTGACGGTGGTCGATGACTTCACCAAGGTGTCGGTCGACATCTTGGTGGAGTACGGTATC^ 

GCGGTTTTCGTGTCACGCGGGCGCTGGACGAGATGGCGCGGTTTCGTGGCTACCCGCy^GGCGATCCGCACCGACCAGGG 

AGTTCACCGGCAAGGCGCTTGATCT^GTGGGCCTGTCAGCGTGACATCAAGTTGAAGCTGATTa^GCCTGGCCAGCCC^ 

GCGCCTTCyVTCGAGTCATTCAACGGCAAGTTCCGGGGCGAATGCCTCAATGAGCACTGCTCGCTGGTCGAAGCt^ 

TCGCGGCTTGGCGGGATTACAACGAGCACCGACCACACAGCGCCATTGGCy^TCTCTCCCCGGCAGAGCTTGCTGCG^ 

gaaccaaccagcagcagctgaagcgggaaaagxtgatatcaaccccatagcctactaactaggcagcggt^ 

caggccagtccgaactgataaaaaaacgtccgaaactggcaaaaccttatggccggttttcctaatatttgctccx3aac 

gtatttcaaggagcgaacatgcatatccaatcgttgggggctactgcctcctcgctgaatcyvggagcctgtcg^ 

caggcagcgcataagtccgcctvgcttgcgtcaggaaccttcagggcaaggtctcggggttgccct;^ 

ctttccgggaagttgccggaaagcisttagcgacgtgcgtttcagcagtccccaagggcaaggggagtcccgtactctgact^ 

TCGGCAGGGCCGCGGCAGATCACTCTGCGCCAGTTTGAGAACGGAGTCACCGAGCTACAGCTCAGTCGGCCACC^ 

ctggtcctaagcggcggtggtgccaaaggtgcggo^tacccgggagcaatgctggcgctagaagagaaaggcatc 

atccgcagcatgtccggttcgtccgctggcggcatcaccgccgcccttttggcctcaggtatgagcccggcggcgttcaagacc 

ctttccgacaagatggatcttatttcgctgctcgacagctcgaact^gaagctgaagctgttca^cacattagm 

ggcgcatcgctgaaaaagggcttgggcaacy^gatcggcggcttctctgagttgctgctc^ 

cgggctgagcccctagaacx5cctattgcgcgacgagacacgcaaggccgtgctcggacagatcgctacg» 

cgccagccgaccgttgccgccatcgcovgcagattgcagtccggctccggagtcacctttggcgatctagatcggttgagtgct 

tacattccccagattaagacgctgaacatcacaggtacggccatgttcgaggggcgtccgcaattagtggtgttc^ 

cacacaccggatctggaggtcgcccaggcggcyvcatatctccggttccttcccaggagtgt^^ 

cy^gccgtaccyvggccggcgtagagtggaovgaattccaggatggcggggtgatgattaacgtgccggtcc 

aagaattttgacagcgggcctictgcggcgcaacgacaacctgatccttgagttcgagggcgaagctggc^^ 

cgaggtactaotggcggcgcgctcaagggctgggtcgtcggggtgcctgccctgcaggasascgaaat^ 

ctggaggaattgcgcgagcaaacosttgtggtgcasttgaagagcgagcgcggtgatttcagtggct^ 

aacttcaccatgccggacgagatcyuvggcgcatcttcaggagcgcctcavggagc^ 

caggcttcagagcgtcataccttcgcttctctcgacgaggcgctgctggcyvcttgatgacagtatc 

cagaacccggagatcamgacggggcggtggcttttcgccagaaggcgcxsggatg^ 

agcgccaatggcttggcxx^gtaggctcaagttggacgaggctatgcgctccgctcttcagcgactcgatgcgctggcagato 

ccggaacgcctagcatggttggcagctgagttgaaccatgctgataacx3ttgatcatcagcagttact 

gagacggtgcagtcgccxsgtgctcgccgctgosttagcagaggcgcagasccgcaaagtg^ 

AAGGAAGTTATCTTCCCCTCTCTGTATCGCCCTGGCCAGCOKSATTCCAACGTAGCTCTGTTACGTCGGGCGGAGGAGa 
CGGCATGCCACCAGTCCXXSCGGAAATCMTCAAGCGCrG;^ 

GGCAAACCCTTGAGTTOSACTACaSTTGAGATGGCTAAGGCTTGGCGGAATAAGGAGTTC^ 

AGTGGGGCTTGAGACTTCCCTCXSAGCAACGATGCCAaSTTGCGGCTGa^CCGGCAG^ 

TCGAGGGCXraTTGGCTTTTCGtCGTCGAGTTGGGACTTGTGCCTTOVGGGTTACCG^ 

TGAACTCTCCATTCTCATCCTTGGCACCGGTGAAACTTGCGGCGGACGATGCCGGTAGACTTGTGCTCTGGGCTGAGGC^ 
ATGGCGTTGACGATGTGGATGCACTGAACCGCTTGCACGATAGGCTGCGGGAAGGACATTCACGATTAGTG 

ccacgggtgagttggttccagctcagatacaaaccagcgcgttagtgttcgtttgaacgc^ 
taaaatccxx:a6ttggtaatttctctaatagatcggccacccgaaggtggcctatctgttcatgct 

AACACCATTTCTACCGAGCCCCATCCTGGTTCAACGAGCCGAGGCCCTCGAGGTGGGCTTTTCTGTTTCTGGAGTCCGGATATC 

TGATCCTVCAGCGCGCCAGAACGAGCGGCTGGTTTCGTGCTTTCGCCAGTTATATCGGCACTGGCCC^ 

GTCTGTTTCCAT 

N 
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FIG. 2A 

The Big Island: 84830bp (SEQ ID NO: 2) 

caccggcgatgcatgggatgccgtaggggtcgctgatccagccgaggccgccctggatggttttctggtcggcgctg^ 

ggttgttgttgctctggttgtcattcacctcctcggccggcgccgggaaggtgoscacggtcccgtcgttgaa^ 

ggctgoxsatctgcccacgcacgcaggagagtgtccagtcgcccgaggcggtcccgctggct^ 

gctcgatgccgttggcggtgaggttgtccx3ggccgatgaggattttgaacgggtaaggatcattgaccgtcccgt 

caosaccgatcagcgcaaacatggccaccgagcccatgagcgtcgagttctgcggcagcgtc^ 

ggttttccaggtc:agatcgctgttggccgacggggtgcagcccgt<:».tcx3atccgctccagcgcgttctgtcot 

cattgccgaaggaggtcgggaagctgaatccgctc^gttgggtggtggagccggcggca^gcggctggccattg^ 

cgcgggcgtcctggggctcgatccagacgatatccxsatgaagacgagcccgctccctgaaagtgctggccatccct 

cgccgaatcctacgggcaggtcggtgtcaccggcctcgggaatgtgggttaggttctcgagccgtttctgtacctgg 

tcgtctggctacggttttcctgctgttgctgmgttcctgggcggtgttgttgagcttctgctcgatgttctc^ 

gcaggcggccctgcaggttctcattggcggctttgagcgagtcattctcctggatcaccttgctgatct^ 

ggctttcokrcaasatggtgcgtagtgtgtcggcgggcgtgtcgccgtcgatgcccagggttgcract 

tcggggttgcggoxrcttgtgactgcgcactttctttcttgcccatcatgctcacgccgatcaggatg^^ 

ggacgaccagcaatttcaggaggggattaccggtcagcgccacgaasgcctcctttgggatcgatct 

aggagcgcgtcxxscaaggccgcggccgcgcgtcacaigatacacggtagtggtgtcggaggcgtcgccc^ 

gggtgctggaaggtcgcggcgacgaaattgcccatcaggtccctgggatccagggccaggtgctgggcgctggcgttctgcagc 

ttcaccgccgtgatgtagtagtcgtccagccgccaggcgcccaaggcggtagcastgatgggtaggctggggagcac^ 

aggtosagctgtcgcttgacgcgcacctgaccgacgccatccacaxsttccaccgtgcgm 

gcatagcgcxstcagaaccacggggacgggcgtttcgokrggcacggccttcggtgcttctgcgtgctcg^ 

gccgatggctgggcttcccgggactggccataatgcggatccactggctcgccggcgacgatcctgaccggctcgcgcgg^ 

tggtcggccgttgcttcggtggcggcgatatosatgagcatctgctcgccyvttggtcxscgtcctgtaggcgc^ 

ggaatoksctcgttggcgagcaggtagagtgcxxrcgccggtactctggacgcgcagcttgccctgc^ 

actcgcacgtttctgtcgacgaaaacaatgcgttcctggccgaccgtcaatggaatggccaacggaatgcgctcc^ 

atctccaccgcgtgggco^gtgtgggtagggcaagcattagcaagagasagcctgtcga 

gcttggtgggggcggccggctcctcm.gctcgatacgttgaggcgtgtcx3gagtagcagtcccactgcaggccga^ 

tttccgggtcgacgtcggcgcggataacgtgcaacgggtagcgggccagcgcccgcttgatcttctcgccggcgtaatactccg 

tgctgtccatgtccaagttgacggtccagtcattgatcgagtgctggatcacgcggccyvttgctctcgccaat 

ggatttccgaggtggtgcgctcgcgacccctgagctcgccggcgttacxacgaaactcaaagtctttctccagg 

aggagggagtgaggtaggcagratagcggaacaggtttcccttgtagtccacctcgccgtccttgggccy^ot 

ggaagatgtagaggccgaacgcatagacgcrctctggcggaatgtcccaccacaaccgggtgcttcctgagcgc^ 

gaatgtggatccyvgaggtccttacgggcttcccagacgcagtaggccagcacaacgatgaccatgc^ 

accggaaggtgttgatgtgtgcctgctgttgcgcagtgtgttttctgaaactcatggggactccgtgcgtcgactggtccaggc 

gcoksagcgcgtaaccaggcgacggttgccgaagacgggaaagcgcagcgagagggccy^tctccac 

ato^ggccgcccccxscttcatccgccgcaggacgcggctgccgatacctaggccgagggcgccgaccagm 

aatgcaggcagcgttacctgcgaccaaggcagccgggatgccaagaacgaacccggctgctccgctggtgaagaccgtgatcm 

catttcgtctgcggtcaggccgccgatgactaccggttgccggttcaaacgggtcggcaggaagctgagggttccatcctgaaa 

cyvgatgttcttcgggo^tgggaaggcccttacatgatggcggtggctttggtgacgagataaatgatcaag^ 

cacctacggctaotcccgctccgagatccgaccacttcttcttcccgtcatggatggcgtgataggtgccgtaggtatgccaa 

cgacccccagaaagacagccgcgcagatgagcagcgcgaggagavtcgctccgtcatagccgaagttctggatggtt^ 

tgcccgatccctccccacggctaggtgcctcgggtttggggagtgcagccaaggcgataccggggagcgccaaagtgatggcc^ 

ccaggctctggcagagggcggacagtttctggagggtgagtttcagcatgagtgtctccgtatgggtcagctgaggaggaagaa 

agaggtgatgccgaggaggaccaagatccgggatcgcggaagcx3ccaccggtggcgctgtcgaaggttgttggtggcc 

cgccyvggtgctgtacatcgcccaggcggaccacagcagaaggaaggtcatcgctgctca^tgaaca 

gggggaaagccagcggcggcctggaaotctgatgtctgggctccgctcatgctcatggcatcggccctccgcgca 

tccctgataagtcggaagggtcacgcggttgggcgcggctgggcgtcatgtagtcctgcaggccttggcggatgcgctgcaggt 

cagcggccaggcgcgggtagtcgaagtagaagcgctgtccgggctcatcggcgccctgggcactgcggcgggcg^ 

gggcgttgagctgccggatcatcacctcovggttcgcctgctcggaggcgctcgctgcgaaagctccctgtgcgaccagcg 

tggaggccaagacggccaggcagcttcggosaagggatggatgtggagto 

cagcttgccgatcgccctgattca^ttccgcaagaaatactcggcx3cxxk3tccggggtatot 

aaaacagcaggtagctgacgggcagagcgacgccccaagcccagccttgggcgttttgattaccgatctcggttacgccccgca 
tgagtggctgaacgctcaccagctcccagccttcctggcccagacgcacgacctcggcccagcgcagagtc 

cgacatggtattcgaagggctgttttatgctcatggtgtgctcctggatcggttcgagtagaaccgcacggtct 

aggagggcctct^ccgaaaatactaggcctagcaccagtgaatttctggtggttggcgggtttagagacacgtatggacgtctg 

ggtgtatatttcatatatatttctaagggagacggctgatgcttagaaacatctctattggagttttgctagccatggctgcta 

tgttgggcagttatggggtggctgccgctacattacgatgcgggtcggctuvttgttagtgagggcgacttg^ 

ttagaaagtgcggcaaccctgatagccgtaaaattgaagggcccgcagtggatggtagtggctatatagtc 

ctgtcgaaaactgggtatatggaccaaggaatggatggtaccagaagcttaggtttgtcgatggaagactagttca 

gcagtatggactagggtatagccgtggatggtgtgttttcatccacggctataagtctcatccggcagatgatataagggtaa^ 

gatatttgcgattggtaggccttgtgcgtcggaaataaacacggttgtactggcgcctgcacgggaaaactatctgttgtaggt 

tgttcx3gattaga»tgccaccgttgtaattggcttggaattgcttgctggaacttgctatgtctagaa 

TTTGGTTGGCGGATTGTGCGGAGTTCATGCTGATTGATGTGTTTGATCCTGTCGCTCGGAGCGCCTGCTGGATGTTGTTTGAAG 
TGAGTTGGGCACCATTAAAGTTGGOSCTGAATTTTTTAGTGTTGCCCGCGATGTTGATGGCCTGTAAAAGGGTGG^ 
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CCGCCTGAGCGGTGTTGACGCTGATATTGGTACCCGAGGCCGCAGCGCTGATCGCTTGCTGAAGGTTGTCTGCTGTTAGTTGTG 

CACCATTGAAGTCGGCGCTAAACTTTTTCGTGTCTTTGGCAGAGTTGATGAGGGCTAGTAGGGTGCTTATATTAACCGCCTGTG 

CGGTATTGACGCTAATGGATGTTCGGGTTCCTGCGGCGTCTAAAGCTTGTTGAATGTTGTTTGAAGTAAGTTGAGCGCCATTGA 

ACTCTGCGCTGAATGTTTTTGTGTCACCTGCTGCATGAATAGTTTGAAGAAGGGCGGTTATATTTGCCGCTTGTGCGGTATTGA 

CGCTTATGCTGGTGTTTGTTCCTGCCGCATTTACTGCTCTAAGTAGGTTGTCGCTGCTGAGTTGGGCTCCATTGAATTCTGCGC 

TAAAATTCTTTGAGTTGCCTGCTGTAGATATTGCGGAAAGAAGTGTGTTTATATTAATGGCTTGTGCGGCTATTGTGGATACTC 

TGGTTAGTGATCCTGCTGAGTTTATGACTTGCTGAAGGTTGTTAGO^GTCAGCTCTGCGCCGTTGAAAGTTGTTGAGAGGTTTT 

TTGTGTTTTCTGCAGTGGTTGTTGCCTTGAGAAGAGTTTCTATATTTAATGCTTGCGCGTTGCTTATGCTGATGCTTGTATTTT 

TTCCTGCTTTTGATAGGGCCTCTAGCATGTTGCTTGGATTTAGAAGGCGTCCGTCGAAATCAAGGATAAGTTTCT^TCrC^ 

TCCGCTGTGACCTAACGTTAAAGTCTTGGTGTGAGTACTTCCTAGGCTGAATGAGTTGGCTACAAATCTCTTTGCCTTATGTCA 

ATATCAGGCATTCGCTGTGACCAATATCATTTTTTGAGTCATTTTTTACAGCTAGCTATCTGCTCCTTCAGTTTTCTGTCGTCC 

ACGTACCGCTGCATGATCGACAGCACCAGGCTGTTGATGGCT^GGATCTTCATCGCGCGGGTCACTGCGACGTAGATCAGGT 

AACTCATCGTCGCGCTTGCCTGGGTCGGTGTCGGGGGCCAGCGGGTCCGCGTTGAAGTCGTCGTACAGGCAAACGAAATCCCAT 

TCCAGCCCCTTGGCCTTGTGTGCGGTGGTOVGGGTGATTGTTGCGTCCAGCTCATCGTCAAGGGTCAGTGAGCGAAGCTCAA 

ATCCGCGCAGGCAGATCAGGGTAGGTCGATATGATCTTGATCGAGCGAAGCATCTCACCXSTCCTGGCTGATCTCGGCGA^ 

ACGTACTGGGTGTAGTCGCGGTAGTCACGGAGCAGTTTCTTGTTCTGGACGTTTTGGCGCAGGCCTCGGCTGAAT^ 

TCCTCCAGGTCGCGCAGCGAGTAACTGTCGATACCGCCTACCCAGTGGAATTTGGGCTCCGGATGATTGCXK3ACCAG 

GCATTCrCGATGACGCCGATAACGGTGCGGTGAATGAAAGTGCGGTGAGGAAGGTCCGGCGGGAGGGACTTTTTCACCAGCGTC 

TGCGGACCCAGTCCTTGAAGTTTCCGTGTTTCGCCCTTGTAGGAGAGGATGATGTTGGCCACGTGT6CGATCGCGGGGCCGAAT 

CGCCAGCTCTGGGTCAGGTAGTGCTCCTOXSCXSCCGGCCATCmGTCGCTGTTCAGGGCATCTTCTGCGCCCCTGAA^ 

AGCTGCTGATGGGGATCGCCGACGATAGCCATTCTGATGCGCTGCCAATGGGCAATGTCCGCGATCACTGC^ 

CCCTCGTCCAGGAGCATGCAGTCGAAGCGCTGGCTCMATCGGGCTTGCTCAGTTGATACAGCTTCAGGTAGCOT 

ATCAGCATGCCXKSTGTCCTGGAGATCGACCATGCGCCTCCAGACTACTCGCGCCATCTCCAGGCCCTGCTTGAGG^ 

TGAGCACTGGTGAGGAACGCCTTGTCGCGGAAGCGCGGGAAGTGCGGTCGGCCGAGTTCCGCGTOKSCGCTGGC^ 

TTCAGCGTGGCCAGCAOSTCACGTACCyWVCTCCCAGTCTTGGGTAtCGAGTCCGCGGGCGATATCGGTC^ 

GTCTTCTTGTGGGCGTACTGAATGCCGTACACCGCATGAGCCAGACTGTGGGCGGTCTTGCACACTACGTTGCGGC^^ 

CCCTTCGCGGCTTTCTCCACCGAGCTGTTGTAGCAGAGATAGAGGATTCTCAGGGTAGGGTTCGACCTGGCAAAGCCC^ 

GTGGTAGTTTTGCCAGTGCCTGCGAAGGCTCGCACCAGGATCTTCGGTGCCTTCmCTGGAT^ 

CACTGCACGGCGGGTCTCCTACAGGTACTTTTTGAAGGTGGATGCTGTGATGCATACCGC^^ 

CAACAGGATGAGCAGGGGATTGATGCTGATGGGGATTGCCAGGTAGAGaSTCC^ 

GCGCGCCTTGTGGTAGAGGTAGCTGGACTCCCGGCCGGCGCCXSAACTTGCGCAGGTCCCGGCGCACaVGGCCGTCCACC^ 
GGTGAACGOSGCCATCACGAATAACGGGATCGTCATGACCAGGATCACCAGGaSCACGACGA^ 

GGCCAGGCCXSTAGTTCTGCAGCGTGGAGACACCGTGGGCGGTGAGATAGCGAACGTCCAGCGGGCTCCGTGGCCGC^ 

GATGGTAGTCATGTTGGTCyVTCCAGTCGACCATCCCGGTCTTCACGAACAACCAGTCATAGGCCAACTG^ 

GGTTCGACCTGGCTCCTGCACGACGACGCTGTGTAGCAGCCCCTGaSACAGCCAACTGAGTTCGTACTC^^ 

ATGCTTCCAGCCCGCGTCAGGCCAGAAGAAATACAGGCATACCCACTCCACGATGATTGCGCCGATCAGCGACCCGAAC^ 

GCCGATTCAGGAGCAGGAGTAGGGGGGCT^TCGAAAATGCCAAGACTGGTGCCGTGGGTTTTCTGGGGTAGCCGCGG 

AACAAAGATGTTCAAACGTTCCTATAGGAAACTGTGTTGTGAACTCCATAGAACTCCCCTTTTACACG 

GGAGTGTGTATCCACTAGTTCTCGGGGAGGCCGTCACTGTAGATTTTTTGGGGGGGTATCTCGGCATAGGGGGGATTGAATAGC 

GCTGTGCCTGmGTCCTGAGACCTCATTAGGAATCATCTGATAGAGGGAATTTTTCCATC^ 

GAAGAATTTCTTACAGCCTTTTGATAGCGGTTTCTCCACTCCGAGTGCTGCGCTCAAGCTGCTCraCATGCT^^ 

GATGTTGTGCGTGCTATGCAGCCTGATATTCT^GTGTGAGCATGGTTTTAAACCATCAGGTGTCCCTCAGTCGGC^ 

TGTGGCTATGTAOSAAGCGCAGCTTTATTTCGAGCAGCGCGAGGCGTTGCTCT^TCACTTGAGCGGCT^ 

CX3CGGGTAGAGCGCTCGTCMCGAAGCGCCGAACAATGTGAGCATCCTGCCGTT6AGTGAaX5AGGGC 

CGCTCXSCACGCTCGGTGATCTCCGGGAAAAGCGGCTGGCACTGATGTATCTGGTCGATACCGACAAAGG 

GCTTACCGCCGATGGTAGGCCCTCGGGAGCGATATCCAGOVCGATAACCAAAGAGGTGTACCGAGCCTTGCTGGCGA 

GGCGCCTGTTCACTGGGTGACTGACGGTGGTACCCCTCAACGGCTGTACCTTTTTGAATCCTTAGGC^ 

GTGGCTAGGCCTGGAGATTCTCGGCGAAGACCTCGATTCGATGTTGCGCCGGAATGATGCCGGAAACTACATGCTGCTGGATC^ 

Ga^TGGGCAGGTOSTACTaSCTACGGACGCAGAGGCGCTGGGGAGCGGTGCGTCGCGGACGCTTTTGCGTGGAGACGGCTTCGG 

TTTCATCGGTGCTGGCCCy^CTGCCGCAGCATATGGTGCTTTTCCAGCACGTGGGGTCTTCGAGCTGGGATCT^ 

CGGTATCGGTCX»CCTGTTGCTGGCTCTGTGGCTCCCTCTGTTACTTGCCTCTGCGTTGGCACTCGCAGTCX3G» 

TTGGCTGGTGCGGAGCATCGAGCGACGCTTGATAGAGCCCGCAAAGCGACGCCTTGAAGCATTGAAGGAGAGCGAAGC 

CCGTGCAGTTATCCAGGCCGCGCCCGTCGCGCTGTGCGTGCTGCGTCGTGCCGAC6CCGCAGTGGTCCTGGAAAATCCCCAGGC 

GCGCCAATGGCTGGGTGATAGCGAGGCGATTGCCCACGACGCGCCGAGATGGATTTCCCAGGCGTTra 

TTCTGGAGAAGAACTGGAAACOSAGGCyVGGGCTACATCTTCATCTCAATTACACGCCCACCCGCTATAACC^ 

GTTCTGCGCCTTCAGTGAAATCAGTGCACGCAAGCGGATGGAGGCGGAACTGGCTCGCGCAAAATCCCTGGCTC 

TGAAGCCAAGACGCTGTTTCTCGCCACCATGAGCCATGAAATCCGCACACCTCTGTACGGCATGCrTGGC^ 

TGGGCGTACCGAGCTGAGTCGGCAGCAGGCCGGTTACCTAAAGGCAATCCAGCATTCCTCGTCXSACCCTGCrGC^ 

CGATGTGCtTGACGTATCCAAGATAGAGGCCGGCCAACTGGACCTAGAGTGCGTGGAATTCTCCCCGCTGGAATTGACCGAAGA 

GGTCGTGCAGTCGTTCACOKSTGCCGCGCAGGCCAAGGGGCTGCAGTTGTATACCTGCCTCTCTGCGGAGCTGCCGCT 

GCGGGGGGCCGCGGCGTCGATCCGGCAGATTCTCAACAACCTGCTGAGCAACGCGGTGAAGTTCACCGACAATGGCTATC 

CGTCCACCTGAAGGCCAGCGTGGTCGATGCCGAATGTGTGATGCTGACCTGGCAGGTCAACXSATACCGGCATGGGGATC^ 

CGAGGATCAGCCGCGTCTGTTOSAACCGTTCTACCAGATACGCCGCTCCGAGCATCCGGTCGCAGGCACGGGCCTCGGCTTO 

GATCAGCCAGCGCCTGGCGCAGCTAATGAATGGCAGTCTGAAACTGGTCAGTGAGCTGGGGTTGGG»GCAGCTTTAGCCTC^ 

GCTTCCGCTTGAGCGGATCGCGATGCAGGCTGAGCCGCAGGACCTAGCCGGGTGCGCCGTCCAAGTGCTGGCGCCTGTCC^ 
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CCTAACGGAATGCCTGTGTGGCTGGATCTCCCGCTGGGGTGGAAGGGCCATGGTCGCGACGCCGAGGTCGCTGGACGAGGCGGA 
CGCGACCTCGCTGCTGGTCAAAGTGTTACTGCTGGAGGGGGCGCCGATGTTCGAAGCATGGCCAGGATGCCGGGTGGAGCTTTC 
CCCTCAGGGTGATATGGAGCCGCAGGCACAGGGCCGCGACTGGCTGCTCGGGCTCAACAACCT 

GGGCCTGGCCCATGGGCGTCTCGCTGATCCTTCGACGCCGCCGATACGGCTGGCTCCGTTGCGCAATCTAGGTCTCCGCGTCCT 

AGTGGTGGAGGATAACGCGATCAACCAGTTGATCTTGAGGGACCAGATGGAAGCGCTGGGCTGCAGCGTGGAGCTGCTCTTCGA 

TGGTCGCGAGGCGTTGCTGCACTGCCAGACGGCCTGCTTCGACGTGGTGCTCACCGATATCAACATGCCGAACATGAACGGATA 

CGAGCTAACCGCGGAGCTACGGCGCCAAGGGTTCCGGCAGCCGATCATCGGCGCGACGGTGAACGCCATGCGTGAGGA 

GCGCTGCATGTCCGCCGGGATGAACGATTGCCTGGTCAAACCGGTGGATCTGAATGCCCTTCAGAACTGCTTGATTAATATTCT 

CAAGGTGGATCGATGAGCTGGAAATCCTATCGGGTGCTGGTGGTCGAAGATCAGCCGTTTCAGCGCGAATACCTGCTO^ 

TTTCGCGAGCGCGGCGTGCAGTACCTGGTAGGTGCCGGCGACGGCGCGGAGGCGTTGCGCTGCCTGAAGCAGGACAGGTTCXSAC 

CTGATCCTCAGCGATCTGATGATGCCGGGCATGGATGGTATCCAAATGATCCTGC?W^CTGCCGTATCTCAAGCATCGTCC^^ 

CTGGCGCTGATGAGCTCCTCGTCGCAGCGGATGATGCTCAGTGCCAGCCGGGTCGCCCAGAGTCTCGGCTTGTCGGTAATCGAC 

CTGTTGCCCAAGCCGACTCTGCCCAAGGCCATCGGCCAACTTCTGGAACACCTGGAAAGATGCCTCAGGa^GAAGCT^ 

GAAACCGACGAGACTCCGCATGGGCGCACGGCGTTGCTGGATGCCCTGCATAACGAGCAACTGGTGACCTGGTTCCAGGC^ 

AAATCCCTCCACACCGGGCGCATAGTCGGCGCCGAGGCGTTGATACGCTGGAGCCACCCGCAGCATGGCCTGTTGCTGCCCA 

TGTTTCATGAGTGATGTCGACGCTACCGGTCTGCTVCGAGGCGTTGCTCTGGCGCGTGCTCGAACAGACCCTGAACGCCC^ 

TCGTGGCGCAGGGCGGGTTACGAGATTCOXSTTTCGGTGAATCTGCCGCCGCACCTGCTCGATAACCAGGAACT 

CTCTATGAGTACGTCGGCGCTCGCGGGGCTTGTACCAGCTCACTATGTTTCGAGTTGACCGAGAGCAGTGTCACAACTCTGTCA 

AGTAACTACTATGCAGGTGCCTGTCGCTTGCGCATGAAAGGGTTCX3GATTGGCCCAGGACGACTTTGGCCAG 

TTCTATAACCTGGTCACGACGCCTTTCACGGAGCTGAAGATCGACCGCTCCCTAGTCCAGGGATGCGTAGAGGAT^ 

AATGCAGCTGTCATCAGTTGTATTGAGTTGGGTCACCX5CCTGAATCTCGACGTGGTGGCCG 

CTGAATCTTCTTCGTCGTCTTGGCTGCGACCGGGCGCAGGGTTTCCTGATTTCTAAGGCAGTGTCTGCTCGTGAGTTC^ 

CAGTTAAGGGAGGACGGCCCCAGCCTCCTTGTTTAACGCAGTATCCCCATTATCGCGGA6TCGATCGCAGCCAACCA 

CGO^GAGTGTCGCTGAAGGAGGCCGCATCCCGTGAAGTCTGCTAGTGCCTTGGAGCACGAC^ 

CAACCCTCTCGCAGAGCCrrGAGCATCGGCTTGATCTGTGTGGTGGTGCTGACCGTATTGCTGTTCAGCATCT 

TGGGGAGATTGTTTCAGGAGGAGGAGGAO^AAGTCTCCTTCCACTTCACCCGTATGATGGATGTTATACC^^ 

TTCTTGGGCGCATC6CTCG7U\AAAGCGACAAGACCACCCAGAAGTACGACTATGACGTGGTGCCTTTO 

aUU^GGAAAACGGATTAGCGGTCTATGAGGGACGGGAGTTTTCCTTTGCTATGCCATTTCTACTGGCTACCAAGm 

GCGCCGATTCCTCGGGAGATCCGTTTTCGCTCGGTGTATTGCTCGCCAATTTCTACGGAAGCTTCTGGAGTGTTTCCGCCTATC 

CCGCGCCACAGTTACTGATCTTTGATCTTTCCGGCAGCACCCGCCTGGCAGTGCCGTCGATTCCCTCCACAGCGCAGCGT^ 

GGTTGAGCGGAAGCTATCCGATGATAGTCGAGCGCATTCTGGCGCGCTTGCGCACCCGGCCGGTGGGGGAC^ 

TCCATTGGATACGCGCTGATCGCTATCGCGACTCGGCGCTGGA6AT6TTGGGAGTCGCCCGGGTTGATCTGCCGGAAACACTCT 

GGTGGCAOSAayVGCCGAACCATCTGATCATCGCTGCGAGCCTGCTTGATCTCAGGC^^ 

AGCGCCCGGCATTCGATTCGTACT^GCCTGGTATCGCCGGATGGCGAGGTATTGCTCGGaSCGGCCCCTGCGACCGGCCTGAG 

ATGGCCTGAACCTCACCCGACAGGGGGTCGCCGTTCAACTGCTCAGCCAGCCTGAGAACGGTTGGCTCGCG^ 

ACTACGGCAATTTCTTTCGCCACTCCCGGTGGCTGGTGGCAGGTCrGCTGCTGACCCCGGCGCTGCTCCTGGCCGGTTGGCT 

GGATGCXSTTGGTACACCAGCAGCGTCGTCAACCCXSGTGCATCGGGCGCACCGGCAACTGGTGGAG^ 

CGCTGATAOVGACCGOSCCGGTGGCTCTGGTGGTGCTGACCCAGGATGACCAGC^ 

AGTGGCTGGGCGGGCCCACGGAGATCCTTGGGCTGACTTCO^CTGGAAGCTTTTCGATGCGCGTGGGCA 

TCTGTATCCAGGTOSGTGGGCXSCTATTTGCAGACCGCCTTCGCGGCGACCCGCTATGCCGGCACCG^ 

TCAACGACATCACGGTCCACTGCGAGGCGGAGACCGCGCTGTCCAATGCGAAGCGAGCAGCGGATG 

CCCTGTTCCTGGCCCGCATGAGCCATGAAATCaSTACTCCCCTGTACGGTGTCCTTGGCACCCTGGAGTTGC^ 

CCCTGAACGAGCGGCAACGCGCCTACCTACXSCACCATCCAGAGTTCGTCTGCGACGCTCATGCAAC^ 

ATGTCTCGAAGATCGAAGCGGGGCAGATGGCTCTGACCCTGGCCGCCTTCAATCCGCTGGACCTAGTGCGGGAAGTGCTTG^ 
ACTTTGCCGCCAGCGCCATGGCCAAGGACCTGCAGTTCTATGCCTGCATCGACACaS^ 

TGACGCGGATTCGCCAGGTGCTCAATAACTTGGTGAATAACGCGCTGAAGTTCACCGATATCGGACGGGTGGTCCTGCGCGTGA 
AGTTGCTCTCCCGCAATGATGGTCGAGCCCTGTTGCAGTGGCAGGTCGCCGACACCGGTATCGGTATCGCACACGAACAGC^ 
AGCGCTTGTTCGAGGCGtTCTACCAGGTTTCGGGAGCGCACCATGCCGGCGGCACGGGGCTAGGACTGTCGATCTGCTGGCAT 
TGGCGGAAATGATGGGCGGTCACCTGCGAATGGTCAGCGAGACAGGGCTCGGCAGCAGCTTCAGCCTC 

AGGACGT^CAGTCCGGGCTGGCTTGCCGGCCGGGGCTCTTGAAATCCGCTTGCGTCGATGTGCGCTOSCCCGTGCGGGAGCTAG 
CCGACAGOSTAGGGGCGTGGCTGAAAGCCTGGGGCTGa^GGTCAGCAGCGGCGAGGCGGCGCCCrC 

TGCTTCTGGAGCTGCTGCCGATGGCGGCCGGGCCTGCTTCrTCX3CCCTGGCCAGGCCCCCGGGTGCGCGCX;TCCATGGATG^ 

CTTGCCAGCCGGAGCTGCGTGAGGACGGCK5GCGTGTCGGCCTGCACAACCTGGCGGGAATCGGCCAGGCCCTGG 

TGGGTGGCGATATCCCCGAGCTUVATOCCGGCAAATGCCTGCXSCCCGCTCGGGGAGACTCG^ 

ACAACCCAGTCAACCAGGCGCTGCTTCGCGAGCAACTGGAAGAGCTGGGTTGTCGCGTGAGCCTTGCCGGC^ 

CCCTG(^GCTGTTa5ACAGTGGTCGCTTCGACCTCCTGCTCAGCGACGTCAACATGCCGAACATGACCGG^ 

AGGCGCTGCGCGAACGAGGCGAGATOCTGCCGATCATCGGOSTGACCGCCAACGCCCrGCGAGAAGAGGGCGAGCGCT 

CAGTGGGAATGAACAGTTGGCTGGTGAAGCCGATCACTCTGCATACCTTGCATGAACrGCT^ 

TCGTGCTTCCCGCGCAAGCGCGAGACCTCGGCCCGCCCGCGCT^GCTCGACGACGGTCTCTCACaSCAGGTGCCGGAACGC^ 

GCGCGCTTTTCCTTGAGACCATGGGCAAGGACCTGGAGGCCGCCCGGCAAGCGATTCGCCGCT^CGACCCGAAGGGGCTGm 

AGGACCTGa^TCGCATGGCCGGCTCCCTGGCGGTGATGCGTGCGOSAACGCTGGTGGTGATGTGTCAGGGCGCC^ 

TGCTGGAGTCGCGCCTTGAATGTTCCGCCGTGGAGATTGGCGAGGTGCTCGTTCATATCGAGCAGGCGCTGGAGTTTGTGAGAA 

AGACGGGCTGATGCXSTCCGGGGTCAATAGTTGGAATTAGAACACAAGAGAAGCCTATGAGTAAGCTC^ 

TGACCATCCGATCGTGCGTATGGGCGTATGCGACATGCTCGAGCGCGACGGTCGGTTCGAGGTGGTGGGCGAGGCCTCCACGCC 
CAGCGAACTGGTCGAGGTGTGCCGGCAGAGCGAGCCGCATATCGCCATTACCGACTAC7VGCATGCCCGGGGACGAGCGCTACGG 
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CGATGGACTGAAACTGATCGACTACCTGTTGCGCAACTTTCCTCGTACTAAGGTGCTCATCTTCACCATGGTCGGCAACCGCCT 

GATCCTCGACAGCCTCTACGATCACGGGGTGTCCGGCGTGGTGCTGAAGAGCGGCGAACTCGACGAGCTGCTCTTGGCGCTCGA 

CGTGGTGAAGCAGAACCGCGTCTACCGGGGCGCGAACATGCTCGACCCGACCAGTGTTCTGGCGAACCGCGACGAAGTGGAAAG 

CCGCTTCGCGCGCTTGTCGATGAAGGAGTTCGAAGTACTCCGTCACTTCGTTTCCGGCAGCAACGTCTGCGATATCGCACGGCT 

GCTGAAACGTAGCGTCAAGACCGTAAGCACGCAGAAGGTCTCGGCGATGCGCAAGCTGGAAGTGAACAGCGACCAGGCCTTC^ 

GACCTTCTGCGTGCATGCCAACTTGTTCCATTGACAAGTTTCCACCGCGGAGCGTTCGAGACGCGCCGGCGGATGTCTCAGTGA 

AGCAGGCTCCGTCCCTGCTGGAGATTGCCGCCGTCATCGATCCAGCTGAAGCGCACCTCGGCCTGGCCGCCGGGCAGCGCCTGG 

CGGAGCGCCAGGACGCGCCTGGAGAAGGGGGGCAACAGTTCGGCCGCCGCCAGGCGGGTGTGGCCGCCA^ 

AGGCTGATGAAATAGGGAGTAGGGTTCTCCATCTGTAGTGTTTCGCCCCGCCGCTCGAACCTGAGCTGGTGTGGCGCTTCTTCC 

GCACCTCTGGCCAGAGGGCGAGGGCGGTAGATGACTTTCATCTGGGTGCGCAGTGTAACGGTCAGCCGTGACTGTCCTTCGGCC 

AGCAGCCCGGTGGCTTGCGGCGGGATTTCGTAGAGGTTCAACCAGTACAGCGATTCGCGGTCCCCTGGCAGCGCCTGGCCGGTC 

CGCAGCAGGCGCAGGCTGCGTTGCTGTCCGGGTTCGAGGCGGAACACCGGCGGTAGCGGCATGATCGGCGCGCGGGCAGAC^ 

GGTGTCGAGTTCGGGGCGCCATCGTCGATCCAGGTCTGCACTATGACCGGATAGGCATTCTGGTTGACCAGCAGCAGCGACGTC 

TCGCGGTGCCCCTCGGCGATTATTGCCCGAGTGCGCTCGGCGGTGACTCCGGCTCGGGCGTCCAGCACGGCCAGCATTGCTAGG 

CCGAGCAGTGCCCCTTGCAGCAGGATCTTACTGGACACGAATCACTACCTGCGCGCTGGCCTCCACCCGTCCCGGGGTAACGCT 

GGGCATGCTGCCAACGGTGAGTTTTTCCAGGCGCGCACGGAAGGTCTCGCTGTACTGGCTGATGCCTCCCGCTTCGCCAGTCTT 

GTTCGAGGCGTTGCCGATCACTGGATACCAGCCCCGTGTTTCGGCATTGCTGCCCATCGCGGAATCCTCGCTTACCAGCAGGTT 

CATCGCACTGCCGTCGCGCAGCAAGCGGATACCCACGCCTTGGGCCATACCAGGCTGGCCGTAGCGGTCGGAAACTAGGTAGGA 

GAGCCCGCCATTGGCGTTCCTCAGGCCCAGTGCAGCGGACGCGGCCTGCGCCCCGGCTGAAGCCCTGATACCCAGTGCAGTACC 

GCTGGAGGCGGTGCTGTTGATCACTCCCGTTTGGCACTTGAAGGCCACCTCGAAGGGCATCTCACGGTTCGCGCCGGCATTAA^ 

CTCACTGAGGGAA?^TCGACGGGAACACAACGTGGGGCGTGACGTTGGTTACGGAACAGGTGGGATAGCGCTTGAGCGTC^ 

GTTGTACAGGCCGATGGCGCCCGGCCAGTTGTAGTGCCAGCCGAGGTAGTTGGCGTTATGGCTGGCGCCGACGTTGGGGTAGGC 

CAGGCCGGGACCCTTGATGGCGATGTAGCCAGCGGGCTGGGTGTAGGCGTACAACCGCGAGGCGGTAGTCGGCX5AGTAGTAGCG 

GATGAACTCCAGCGGAGCGCGGAACAGTTCGGCGCGTACCGCGCTGAGGTTCTTCGCCTTGACCAGTTGAAAGCCTC6CGAGTC 

GATATCGAGCCCGCCGAGCAGACGCTCGCGCCAGACATCGGTGAAATACTGCCCGGTTTCCACGTGGGTTAGGCGCAGCAGCAC 

GTTGGGCCAGGCGGTGCGATAGGCGGACTGCAGGCCAATCGAGTTGCCCGCACTGTCGCCTCCTAGGTACCAGCCGCTGTAC^ 

ATCGTCGGCATTGGTGGAGAACATCTCGTACACATCGTCCTCTGGAGCACAGCGGAAGAACACGCGCTCTGGGTCGTAGCCTC 

GCTGTCGCCGTATTGGCTGAGCGGCGCCACCGAGCTGGCGATCAAGCTACCGTCGGGCTGGAAGCTGGCGTCGCTGACX5TTGAT 

CACGCTCGGTAGACCGAGGGAACCGTTGCAGGTATCGCAGGCTCCGGCCCAGGTGCCGGCAGAGCCTTCACCGGGACGTATCTG 

GGTGTTGGAGGTAGTCGTGGCGTTGCCTACCGCCGTGACCTTGTAGCAGGTCGCCCAGCTAGACGAGGCGAGCAGCGa^ 

CAGAGGCAGGACGCGCAGGGATGTTTTCATGGTGCCTCCGAGACGGGCGTGCAGGTGCCCTGCAGGaSGATC^ 

GGTTCGATAGCCTGGCGAGGGCCAGTGTOSAGGTCGTAATCCAGCAGACAGGCGTCCCCTGGGCGTTCGCTCC^ 

AGGCGGCCCTTGTCGCCCTCGGCGCGGGCATAGACCTGGCCGCCTTGACCGACCATGCCGATGTTCAO^CCCTGGCTO 

ACATTGGCCCCTAGCGGTAGOSCGCTGCCGTCGGCGAGTTGGGCCTGGATTAGCAATGGGTGACCGGTCAGTGTGCG^ 

ACATGCAOKSCGGCGCCGGCGTATGGCGCGATCTTGCGCTCGGTCTCaVGaVGCTCGGCCTCTTCGCCC^^ 

AGGCTGACaSGGTTGTAGOKJTAGGGCGAGAGTGATGGCACCACGGCGTAGCCATTGCCGTTGACGCGC^^ 

CGTACGCCAGCTCCGCTCGCGCCCTTGGCCTCGACCAGGGCGAAAGTGTCGCCGACGTAGGGGCCGAGGGTCA 

TGCAGCACGGCGGCGCCGCGCAGGCCGGTGTTGAGCTGCCGGTAGCTGCTGTTCTCGGCATAGCCGGCGTTCAGCGTCGCCA^ 

GAGGTCTGTTTCTGCAGACTGCCGTTGA/^TCGCTGCCGTGGTTGTCGCTGTCGCGCCCGGCGGCAATCGCGTAGCTCAGGCTG 

CGGTCTTCGTCGAGGGTGCCGTTGAGGCCCGTCTGGTAGCTGCTGCCGCGGCTGTCGCCGGAACGCCGGGAGGCCATC6CGCTG 

AGATTCGGGGCCTGGCTGGAGGACCCa^GCGGCATGGAAAGTGTCAGGGTTAAGGTGTTGCTACGTTCGCTGCCGTGCCGCAGG 

TTGTATTGCGGCGGCAGTGAOXSGTCGTAGTCGTCGTTCAGATCGCGGTACCAGGTGGTCTGCTGGCGCGAATAGGCGAGGTTG 

TAGCTGAGCTGGCGCCAAGTGTTGCTGTAGCCGAACTGCAACTGGGTGTCGCGGCTCTTGCCGTCGTAGTAGTCGCTGGTGGCT 

CCGGACAGATACAGGTTGCCGTAGCCCCCCAGGCCCTGGTTGACCAGCAGGGTGAACTGGTTGCGTTGCTTGTAGCTGCTC^ 

TTCCAGGAGTCGTTGTGCTCATCGGCGTGGCGCGCCGAAAGCGCGTCGCCGAGGTCGCGATAGCCCTCGGTGGAATAGCGGTAG 

CCAGCCAGGGTGAGAGTGGTCTGGGTCGGCTGGAAGGTCCGGCTGTAGTTCAGACCGACACGCCAGCCCTGCTTGCGCTGGCCG 

TTCTCCACCGTGGCATGGGAAAAGATGCTGTTGAAGCCGAAGGCTCCGTAGGGCGTGGCGAGCACGCCTCCGCCGAGCAGCGCC 

AGATAGTCCTCGGCCAGGCGOSAGCCGAGGTTGGCGGTTAGCGAGTTGGTCAGGCCGCGCTGATAGGTGAAGTCGCCGAACAGG 

TCGTTGCCGTCGCCATACTGGCGCGCTTGGCCGAGGGTGGCGCTGTAGCGCGAGATGCCCGGGCGCATGGATTCGGGAACCGCG 

GAAAAGGGCACGGAAAAGCGCGAGACGCTGCCGTCGGCCTCAATCy^CTTGGACATCCAGATCGCCGTCGTAGGCGGTGCC^^ 

AGGTCGTTGATGACGAAGGGACCGGGAGCGACGGCGGATTCGTGGACCTTCTTGCCGTTCTGGCTGATGACCACCCGTGCGTTG 

CTGTTCGCTGTACCGCGTACCTGTGGAGCATAGCGGCGTTGCGAGTCTGC»GCATGCGGTCGTCGCT6GCCy^GGCGCACTC 

CGATAACCCy^GGCTGCCGAACAGATTGCCCTCGGTGAAGCTCTCGCCCAGGGTCAGTTCGCTGCGCAGGCCTGGCA 

TGGGCATAGGTGCGGATGCTGTTCCAGTCGCTGCGGGTGTTTCCCGCATAGCTGGAGTAGCTGTAGTTGGACTGGTGGCGCAGG 

CGCCACAGGCCGAAATTGATGCCCCCGCTCAGCCCCAGGTAGCCATAGTCCGAGTCGCCGTTGCCGCCTACTCCGTCGAAGCTC 

GAGCGATAGAAGCTGGCGTCGTAGTTGACGAAGCCCATGCTACTGCCAGCGTCCCACTCGTCGGGACCCACGTAGCra 

TTGATATCCAGCAGGGCCTGGGGCACCGAGAGATCGAGGCGCAGAAGGGCGGTGTCGAGATTGAAGGTCGAGCCTGGCAGGC 

TCTTCCAGCCCCAGGCAATCTCCAGGCGCTTGCACGCCGGCCTCAGAGGCGGGGCGGACGCCCAGCGTCCGGCG^^ 

TCGCCGAAGCAGGGTTTGACGCCGCCGGCCCGGGGGCGGAACTCGATCCTGGTGCGACTGGCGTAGCT6CCGTTGAGATAGACA 

TCGACATGGTAGGTTCCGGGGTCCACCTGGTTGGCCCGGTTGAAGCGTTCTAGGGTCCCGCCGGCGAGGCCCGAGCCCATCAAC 

AGGCTGTCGTCGAAACGATACTGCGCCTCTCCATTGGCCAGGCCCGCGAAGGCCAGCGCTAGGCCGCCCAGAGCCAGCGGCAGT 

TTGCCGGTGCGCCGTGCCTCAACGTGGCAGAACATGCTGGACCTCCATTCGCGCGCCGTAGTCATTGATCAAGAGGG 

CTGGACCGGTCCTGGGGCGAGCGGCGAAGGGCGTTCCGCCGCCCACTCGGCCTGGCCTTTGGGCGGGACCATGCTCGACAGCAG 

CCGCCAGCGACGTTCGCCGATGCTCAGCATCGCGCCGGCGAAGGAGGCGTAGTAGCCGCTGGGGTTGTCCACCCGCACACGCCA 

GTTGCCGCTCGCCTGCACCAGGCTGAAGTGCAACTGCTCGACTAGCTGCTCCGAGCCGCCAAGCAGCGCGGCGGGGCGATAGAA 



Title: VIRULENCE-ASSOCIATED NUCLEIC ACIDS AND 
PROTEINS AND USES THEREOF 
Applicants: Laurence Rahme et al. 

Filing Date: September 12, 2003 Serial No.: Not Yet Assigned 
Page _6_ of 118 Customer No.: 21559 



7/118 

FIG. 2E 

CAGCTTCAGTCGACTGCGCTGCATCAGCAGCATCTGGTTGCGCTCGGCCTTGGCCAGGTCGCGGGGCGGGATCTGCAGCA^ 

GAGATGGAACAACGATTCGCGATCCTCGGGCAGCGGGAGCCCGGTATACAGGAGGCGCAGGGTCTGGCCGCT^ 

GCGCGTCACCGCTGGGCTGACCAGAAAGGGCGCGTTTGCAGTCTCCGGGGTGGAGGAGGGGTCGTCGTTGTCGATCCA 

GATCACGTTGGGGAATGCGTCCTGATTGATCATCTGCACGGTCTTTTCCCGGGCGTCGCCGGGATAAATCACGCGAGTACCGAT 

GATCACCACGCTGCTTTGAGCCGGGATGGCGGGTATCAGCAGGGCGAGCGCCAGCGTAGCCGCACGGAGCGGA 

AGTATTCATCTGTTCACCTTGGCGCCGTGGCGCTCAGGAGGGGCCTCGGGTACTGCGGCGGAACCTGGGAGGGCGCCGCTGGGC 

GGCGCCTCCGCTATTACTCGTAACGCAGGGTGTAGGTCACCGAGCCGAGCACCGGTCCCGGAGTGACGGTGGTCGCTTCGGAGA 

TGTAGCGGACGGCATAGTCGTAGCTGGTGCTGGTCTCGCCTCCTGCGAGCACGATGTCACCGGCAGCCACGGCCGACCCGCCGG 

CCAGATTGACCTCGCTGCCGCCCGGTGCCGTCAGTAGCTGCAGCGCTACGCCGGTGGCGCCGGCGGACGCGGTATTGGTCAGAT 

TGCCGGCGCTGGTCGGATTGACCGCCTGGAACATCACGCTGAAGTGCTCCTCGGCATCGGCCGCCGGAGCGGCGCAACCGGTCA 

GTTGCAGGGTGAAGCTGGTTTCCCCGGCGGGTTTGCCGACTGCGCCGTCAAGAGCGCTTACCGGTACGGTGTCGAGTATCACGG 

TCGGGTCGGTTCGTCCGTCGACGACGGCGGAACAGGTCTGGTCGGTCACTTCTCCGTGGAAGGTGATGGTATTGGCAGCCGCCG 

CGATACCCGGCAATGCGAGAAATGCGGCGATAGTCAGGGCAGTACTTTGAGGTTTCATTTCAGGGTTCTCCTTGTTGAGTTTTT 

TCCTCCGAGCAATGGTCTGTTAAGAGCTATCCGGGCGTATTTAGGACGTTTCTTAAAGGACCGAAAGTGGGTTTCTCGCTCTCG 

GCAAATGTTTTTACTGCAATGGACTACCCCTGCTCCAATAGACATTTCCGGGCCTCATAGCGAGGCGCTTTCAAAGGCCTCG 

ACTGACGCCACCCAGATGACTGTGCCGCCGGGTTCGGGTGTAGAAGACCTCGATGTAGTCAAAAACATCCGCCCGGGCCATG 

TCGGGTTTTGTAGATGCGTTTGCGGATACGCTCTTTCTTCAAACTACTGAAGAAGGATTCCGCCACGGCATATCCCAACAATTG 

CCGCGCCGGCTCATGCTGGGCTCCAGGTTGTTGGCAGCACAGAAGCGCTTCCAGTCGTCACTACCGTGCGGACTCCTATAGGTA 

CTTTTTGAAGGTGGATGCTGTGATGCACACCGCTACGCCGAGCAGTGCGGCGCAGGGCAACAGGATGAGCAGGGGATTl^ 

GATGGGAATTGCCAGGTAGAGCGTCCAAGGGACGACCGCTAGCGGAATGATGCTGCCGCGCGCCTTGTGGTAGAGGTAGCTGGA 

CTCCCGGCCGGCGCCGAACTTGCGCAGGTCCCGGCGCACCAGGCCGTCCACCTVGGCCGGTGAACGCGGCCATCACGAATAACG^ 

GATCGTCATGACCAGGATCACCAGGCGCACGACGAATGTCAGCACCGTGTACAGCGCGGCCAGGCCGTAGTTCTGCAGCGTGGA 

GACACCCTGGGCGGTGAGATAGCGAACGTCCAGCGGGCTCCGTGGCCCGGCCTGCGCGATGGTAGTCATGTTGGTCATCCAGTC 

GACCATCCCGGTCTTCACGAAO^CCAGTCATAGGCCAACTGGGCCAGCCAGGTGGCGGTTCGACCTGGCTCCTGCA^ 

GCTGTGTAGCAGCCCCTGCGACAGCCAACTGAGTTCGTACTCAAACATGGCCT6GGCATGCTTCCAGCCCGCGTCAGGCCAGAA 

GAAATACAGGCAAACCCACTCCACGATGATTGCGCCGATCAGCGACCCGAACATCACGCCGAGAAATTGGA^ 

GATGGTGCCGAGAAGGGTCTTCTGGCTCTCTTGTTGCTGCTCTGCTCGTTGAGTGACTTCAGCCATCAGGCGGCATCCTCACCG 

GTGCCGAGGTGACGGAAATCGTCGACCAGGTCCTGGGGCAGACCATCCTGGAGGGCGGAGTATCCCGGCGCCTCCCACCACTCG 

CTGTTGGCGGCCTGGCCCTTGCGCATACCGGCAGCCAGCTCCTGCAGGCTTTTCGGCATCACCTCGTCGGGGGCGACCGCCGGC 

AGCGGCATTCGGATCTTCCAGAGATTGCCGCCCTCGAGTAGCGCGAACGCTTGTCCTTTGGGCAGACCAATGATGTGGGCCGGC 

TCGAGCATCGGCACGCTGGTCATCTGCACCTGGTCGTGCGAGCTGGAGGTGAAGGCTACCTTCTTGTTGTTGTTGATCGCGTCG 

TTGGCGCCGCTCGCCGGCGTGCTGGTGTAGATCTGGACCTTGGGGAGCTGATTGGTAAGGAGTTCGGCCGTGGCGGTCTCGCGC 

ACCCGCAGCATGAACAGGTTGTTGAAGTTGCCGATGATCTGACCGGCCTTCGCGCGGGAGCCGATCTTGGCCTCGATGTCGCTC 

ATGGTCTGGGTGTAGGCCGTCACCTGCACGCCGGCGCCGCCCGCTTTGTTGACCATGGGGATGAACTCGTCGCCAATCAGCTCG 

TTGAACTCGTCGGCATGCAGGTTGATGCGGACCTTGCCGCTGGCGAGCGAGCCGGGCAGGCCGTCATCGACACCATGCTTGTAG 

ATGTGACCCGCTACCGAGACCAGGTCGCTGAACATGGAGTTGCCCACCGCGGCGGCGACCTCGGTATCCGATAGTGCGTCGAGG 

CCGACGTAGACCACGGCGCGTTTGCGGATGACCT6CATCCAGTCGAAGATCGGCCGCGGATCGTTGAGGTCCGCGTAGTTGGGC 

GAAAGAAGCTCCGAGATCCGCCCGGTAGTGAGTTTCTCCAGTAGCGGCAGCAGCGAGGCCACGATCTTGTCGAAGTAGGTCTTG 

TCGTAGCGCACGGCGCTCTTCAAGCCTTCCATGACCGGGTCGGCGATGCGTTTCTGTGTCAGGTACTGGTCGATGGCTACGACC 

CGCAGGGGCCGTCCTTTCATGTTGAACGGGATGTTCTTGTCGTTGAGCTTGCCCTCGATCTGGATGATGGTGTCCCAGGCCCTG 

GGATCGTGCTCGCTGATGTATTTCTGCGCATATTCGACGAACAACGCATCGATGTTCACGACGTGCCGGAGGATCTGCTGGTAG 

TCAGGCCGGATACCCAGCGCGTGCAGCGCGCGGGCGATGATGTTGACGAACCGCCAGGCGAACTCGCGGAACGCCGCGCTGTTG 

CCCTCGCCGGAGAGCTGGCCGGCGACGCGGGTGGCGACCTCGGAGATCCGACCGAACCGGCCGACGGCGTTGTAGCGTGCCGAC 

AGGTCAGGATGACCGAGGTGGAACACGTAGAACTCGTCCAGGCGGCCGGCACGTTCGCATTCCACGTACATACGCTTC^ 

TCAGCGTCGCCTTTCGGGTCGAAGACGATCACCACCTCGTAGTCCGGCTGCTCCTCTGCGCGCCGGCGCCGATAGCCGTGGTGA 

ACCGTCTGGGTCCGCCGGCCCATCTTCACCCGCCGGCGTCGTACCCGGCAGTGAGTGCGGCGAATATCCTGGGTGATGAACAGC 

TCCGCGAGGCGCGTCTTACCCACCCGCGTGGTGCCGAGTACCAGTGTGTGGCCGACGCGCTCGCCCAGTTGCAGGCCTACGTCC 

TGTTCGCGCGGCTCGATGCCATGGAGCCGAGGCAAACCGCCCACGGGCGGCAGCGGCCGTGCGGGGTTCCAGGCCACGTCC^ 

GCAGTGGCTTTGGCGACCAGCTTCAGGGGGAAGGGGGCGAACTCGAGCTGCTTCTCCAACCGGCGCGCGCGCTCGTAGAGGGGC 

GAGGGCTCGACGTAAGAGGCGAACTGGGGCAGGTAGGTATCTGCCAGGCGCTGCGTATGCTTCTGCGTCCAGCGAAATCCTTTA 

CCGATGAACAGGTGTTCGTTGCTGACCGGCATCTCGGCGCTGGTCATCGTGTACTTCGGCAGTCGGCGAATGTTCCGCCGGTAG 

CGGAGCACCACGCCGGCCTGCTTCAGCCGCACGATACCCAGCCAGGCGAAGCACAGCGCGGCCACGATGCCG 

AGGGAGAACGTCCACGGCGCGACGATGCAGAGCGCGGCTGCGGTGAAGCACACGGTGGTGGTGTAGAGCTCCACG^ 

AAGAGCGCTTCCAACGGGTACTGGCCAGCCATGGCTTACACCTTCAGGCTTGCCGGCGCCTGCTVGGCGGCCCTTGGC^ 

TCGAGCACCTGTTTCCTGGCTTGCTCTACCTCATGATCAGGCAGCATCTGCACCAGCAGCTCGCAGAGGGTCGACTCGTCCTCC 

AAGGCCTTGGACGTCACCTCACCGCCGTAGCGTGAGATCAGTCCGAGCAACTCCTGGTACATGAGTTGGCGTGCCTGAGCGTCG 

CTCGGCTGCTTGATGACTAGGGTGATOVGATGAGTAGTCATGAGGACTCCTACTGCTGGATGGCGGTGGATGTGATGAGGTC 

GGTAATGCTGCAGCCCTAGCCGGTCGACCAGGTCGTCCGCCGGCGCCGGCAATATCTGAAGTCCTTTCCCCCAGGCCCGGATTT 

CCGTCAGGCGCGCTTCGCTGGCCACGTTCACTGCCAGGCCCACGGCTTGGAGGTCTCGAAGCTCGTCGCCTCGCTCTTTCAGCC 

AGGTTCGAGACT^GCGTGTCGTCGCCGACCAGGAACAGCAGTTGCAGGCCTGGAGCGTTGATGGCGCGCCCCTGGACCCGTCCTG 

GCGATAGGCGGGCGGAGCGAACTGGAAACGCACCTGAGCCACGGACGCCCAGGTCGCCAGGTCCTGGTGCGGAAGCGGATGGCT 

GCGGATCCAGGCCCTGGTAGTAGGGGAGCGCCGAGGCGCCGCCGAGGTCTTCGACAACGATCAGCCCGGCGGCCCAGGCTTGAG 

TCGCGAACGCCGTGCAAAATGCCAGCGTGGCAAGGAACTTCTTCATGGATTGGTCACCyVGTAGGTTGACGCCGAGAACT 

AAGGTGGCGGGAAACCGTCCTACGGTATTTGGCGGCAGGCGCGCCGCCGGCGGGGCGGTGGTAGCGGCCGGO^CTTGGATCC^ 

GTCCACCGGTCTTCCCGGGGCGGATGCATACAGGGCGTCCCGCTGCTCGATCAGGATGTCGGAGGTGGCGTCCAGGT^ 
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CGGATCCAGGGACTCGCAGGGGCTGGAGAAACGATGTCCGTTCCAGCCGATGTTGACCTGGCCGAGGCCGGAATCGACGCTCTT 

GGCCCCGTAGAGGTTGATCGCCGCGAGTAGCGCTGTGCAGGCGGCGGTGCGGGTCGCGTAGTAGTAAGATTTCCCGGCGACGTT 

CAATGTCCAGGGCCAGGGAACGTATTCGCCGCGCAGCAGGACCTTGCTCTCGGTCAGCGCGACCGAGTAGAGCACC 

AACCCCTTTGGGCAGGGCGATTTGCTTGTACGCCGGCGGTGGATCCACGGCGGCCTGAGCGATGTTGACCAGGACCAGCAGGGT 

GGCCAACTGGAGGGCTCGAACTACAGACGTTGCCATCGGCCGTTCACCTCCTGTACCAGGGCGGGAAGGGCTCCGCCCAGTCCT 

AGGGCCATCCAGCGGCCCTCGTCATGATTGAGCGTGATCTGCTTGCTGCGAACCTTCTTCGGGTCGATGCCGGCGAGGATTGCC 

CAGCGCCGCACTCGCTCTGCGTCGTTCTGGCTACCGACGAAGTAGAGGTCGAACTCCTTTTCTGCATGTTGCAGGTCGCGGACC 

CGCTGGATGCAGGCGGTGCAGTTGTCCTGGACGAACAGGGCCAGGCGCCCGCTGCTCTGCAACGCTGGGCTCATGTTCGGCGTG 

CCCGACGGGTTGGCTGTGCTGCTTTCGGTGAGGCGGATCACCCCCTCGCCTGGATAGGCGCGGGCGAAGGCTTCGTCGTATGCG 

CGCTGGTAGGCGAGTTCCTTCTCGACCCGGCGCCGTTCGGCCTGGACCTGTAGATCGGCATACCGCCGCCGTTCCTCTGCCGAT 

CGCGCCTCGATGCCCAGCGCGGTCAGCGGATCAATAC(^GGCGAGTAAGCGCCCCGGGGGCCTTGCATGAGCGTCTGGTAGC 

GTCCATTCCTGTTGGTTCAGTCCCCACTCCTGGGCCAGGTCGCTCTGAGATGTCTGTTGTTTCAGCGTGCCGAGTTGGCTACCG 

GACAGTTGGCTGCTGGCCTCTCGGCTTGCCGCCGCCTCCTGGCCGGCTGCGGGATAGCTCAGCAGCAGCATCAGGCCGGAC^ 

AGCGATACGGTTCTGATCATGGGCGGGGTCCTATGGTTGGATGCGAACGCTGCGCGAGGTGCCGGCGACGTCGAAGTGCGCGGT 

ACCATCGTCAAGGTCGGTCAGTCGCCAGGTCGTGCCGGCGACGGCATCTCCCCGGCGAATGAGGTAGATCTGGCTGAGCTGGGT 

GGATCCCGGAGGTGCAACCGACAGAAACCGTTCTCCCCCGCGATACTCCACGCCAAGGATCGAGAAGGGCGGGGGCTCCATCC^ 

TTTGGGTTTCGGCTTGGGCTTGGCGGGTATGGGGCGTTTTGGCGCTGGCACGATCAGCGGCGGCGCCTTGGCCTGCTTTTCTTG 

AAGCGTGCGGACAGAACCGTCCAGTGTCTCCACGGTTGCCTTGAGCACCAAGAGGTCGCCGGCCGAGGCGGTGGTCTGAGCCAG 

GTTCTCGACGGCATCGGAGGCCTGCTTGGCGAACGCCTGCGCAGCGTCAATTCGGTTGGACAGCGCCTGCTGGCCTGAACGGAA 

GTCCTCGTTGCTGACCAGGTGCTGTCCATCCACGGCGTCGAGGCGCTCGTCGACTCGACTCAAGCGAGCCAGGATCGCCTCCA 

CGAGGCCTTTTCCGCGGCACTGTCCACGCCTGATCGGAGCTGAACGAGTTGGTACTGCTGGTAGCTCAGCAGGCCGGCCAOT 

CGTCAAACAGAGGCCAAGAATCATTGATGCAGGGGATGGGCGTTTCATGAGGGTCCTCGCGCGAAGCTGGG^ 

ACGGACAACGTTGATGCAGTAGGAAATACTCGGCGGAAGGGGGGAGGATTTTTTGAGGGGAGAAATTCTTGCG^ 

CTAATGCTCTGTGCGAAGCTGACGAAGCTCGGGGGCTCTTOSTCGCGCATCAGGGCTGAACGCTGGATAGaVCA 

CTGCACCGAAAAAACCAAGTGATTGGCATCGCCGCGTCGCTGGCGGATCGACGATGGGGCGGACTra 

GTGAGGTCATTCGGCTTACCGGCAGTGCAAATAGGATATCCCGCTCCGTGACCGAGAGGATCAAGCGGCGCGG^ 

TGTCTGTGCGGGTGTGGTGGCAGAGAGTTTCCCGGCTGATATTCGGAGGTGGGGAAOUVTAGAACCA^^ 

AAATGAGTGCTGCT^CCCAACAACGCCGCCACAATGCCACTATCGATTTGACGATACCGGTATACGTAAAAGCATC^ 

TATGTTACTCACCCCAGACGAAGGTTAGGTTGACCATTGCGCTGCCTTCTGAAGTTATGGGGCyiTAGATTATGGCCG^ 

CAGAAAGGATGCAATGATGACAAAACTCTACTTTGATCTTCTGAACTCGCCTGCCGAGGCTCATTCGTCG^ 

ATCTGTGCAGGCAATCTCCACAACTGTCCCAATACTGGAGTTTCCTTCGGAAACCGTATACGCCTATC 

ATTAAGTATCGGTCAACGCATAGATCCTGCATTCACCCAGAGCTTAACGAGTGCCATATCa^CCTGGCAGGTCGCCCGATTG^ 

AGTAAGCGACATTTACCAAAAAATTCATGAAACCACACrGAGAACACCTGTTGAGATGGGCGTTCGTCCT 

TGAGGAGTATCAGGCCACCATAAATCAGCAAGCCATCTUVCATGGTTCAAGATATGCAGG^ 

GGCCCTCCAGGCCAATATGCAGTTCCTGTATGGACAGGAGATAAATACTGATTTCATCGCTCGTAATGAACTCGCT 
GAGAGCGAAAACCGTCXSCy^TAGTTCAGGGGCATATCACCJ^TCGGGTACGGCTTCGATACCrTC^ 

AAACTCTTTGAATCTTGTTGGTTCTACGCGACAGAAGGTATTACCTGCATTGCAGCTATCAACGTCCGACCCAGGCTTCTGGAG 

CGTCTATGCCTTGCTGGGACAAAGTCTCACGGATGACGATGGGCTATTACTCTTTAGTGCCAAAGCGCGAGCTGTTGT^ 

CATAGCAAGCAACCAGTTTGCAGGTAAGTGGAATGGGCTACCCCCAGCTATCAAAACXSGTTC^ 

TGGGCAGACTGGTAATTTTCCAAAATTTCAACAAGCTATAAATAGCCATGATTGGCCGGCAGTCATCC^ 

GAATGGTGTACCGAATGATCCTCTCCAGTTCyVTTACAAAAaSATTGGAAGAGCGAGCCAAGTATCTC^ 

TGAGCAATGAGGATATGGATATGAACAACACyVGTGAGCGAAAroCAAa^GATCAAT^ 

CCGGTCTCTACAAGGGGCTGGCTAACCAGTGCTCTCCTGGCCAGCCATTTCCT^GAGGTACAGCTTGTGGAGGCTT^ 

CTCTCGTACTCCATCCGGAGTTTGTGCCTAACGGAGATGTCTCGAAAATCGATAAGGAGTACGGAAOSATCCTTGCTGCTG^ 

CAGCTCAGGTTATCCTGCTTCAACTCCAAATGGCTCAAGACAAGGCTAAGGCGTGCGGG 

TCTCCTCCAATCTCAATACCATTAAGAGTCGTCATGGTGCTAATTATCTAAACCTGCTGAAACAATCACTOAACCGATACCCG^ 

CTAGCGTCGGAGTTGAGATCATGTCAGGTGGCAGTCaSAACCAGGATTCTGGAATCGAGG 

GTCTAACTGAATOVCAACTTCAGGaSAtGAATCTGCCTGCCAGTCTCAAACAGTTGCTCACTCAG^ 

CTCAGCCTGAATATTGGCCTGCTTACAAO^CATAGCCACTGGTATTCGTTATACAACCGGAGTGGCGATAACGTTG^ 

GGGCCACGGTTTAGAGATGGTTGAACATTAGGGAGAAAAGGATAGGTGCCCAGATCAATCTGCTCGTO 

TTCGCGCCCACAATGACCCAAGCTGCGAAAATACCAGCAAATGAGTACTCATTGGGGGATGGAA^ 

CCGGAAAAGGATGAGGCTCAGGOVTTTCTTATCCATAATGATGGGCCTAATGGGGCTACATGCAGCCTTAAAGGCACTCTra 

GATAATAAAGGAGTGGTGCATTCGCCGTATTCCTCTGCTTCATGTTTGCTAAGTATCACCCAGAC^ 

GTCAAACGTGAGGAAAATTCGCCAAGCTGCTCTGCATGGTGCG6TCCTAGAGTTTGGTTTGAAGGAGCCT 

AAGGGCTGCTACTATATGCAAATAAGGAAAAAAACTCGACAAATGTTGGGTATGATTGAGAAAAAAGAGCTTGATGCCGCTOT 

GCCTTATCAAATAAGCTTTTGTCAGACTGCGCAACCGAGCTAGCCTATCCTGCCAAGATATACTTGACGAACACACTTGCCA 

ATCAGTGCTGAAAAGGGAGAGAATGCTCGCTGTTTGGAGTATGCCCATCGGGTGCAAAAGCAAATTCCTC 

CAACCGGCTGAAGACtTGCTCCCGGCGGAGCTVCGCTTTCGCTATGGAACAACGCGCCAAGGCTGATGCTCT 

AGCGACGAGAAATAATGACGCXSTCXSTAGTAGGCGCCTAGTTCACTAGGAGaSATCTTTTCCGACTCTCCCXSCGGCA 

TCGTTCGTGCGGAAGTGCTGGTAGAGOSTTTGCCGACTGATGTTGAATTCGCGGGCGAGCTGAGCCTTGGGCTCGCaSG^ 

GCCCGCAGCCCCAGGGTAGCACCTGCTTGTCGGACAAGGCCTCTTTGCGGCCCCX3GTACGCGCAGAGCTTGATATCCTCGCGCT 

ACCGCGCTGGTGCTGCTTGTATGCTGCTCAGCAAGCCAGCCGCCGGTCTTTTCAGGGTAAGCGTACGGCCAATAC^ 

TAGGTTGAGCCGTATCAGCMAAGAATCGATTCATATGTGTGAGGTACCGCCTCATGCACATTGGCTTTC^ 

TCGGAGACCCTCGTGCCTTGGTGAAACACCATGTGCCAACCATCTTCGTAGGTTTTCCAAACAGAGCTAOT 

TOVTCGCCATTAGCATTTTGATGACGGCyVTCGGTACGTTGCTAACGCGACACCATCTGCAAGCACACXSC^ 
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ATTGTCCTTTTGATCCAAGTCTGGGATTTTAGTCCCACGATCACCTCCGCTTTCGTCCAGCTTTTGCCGATAGCTCCAAATTCA 

ACGAAGTCTTCGGTAAGCAGCTGTTTAAGCAGTCGATCATTCTTTCTCGATTCCGGCTCTAGCAACAGCTGTTCAAGCGCTAAA 

ATCTCCAGGCGAATATCCATTTCCCTCACCTACCT^GTATGATTTATATATGTTTCy^CAGGGGACAATAGATCT^C^ 

ATCTGTACCCACTGATGCGGCAGCAACTGGCTGATGTCTTTCGACCGTAACGTCGGCAGGCGAGTTAGCACGTCCTTCAGGTAG 

GCATACGGATCATGCCCGTTCATGCGAGCGGACTGGATCAGGCTCATGATAGCTGCTGCTCGTTTGCCACTGCGCAGCGATCCG 

GCAAATAACCAGTTCGAGCGTCCGAGCGCCCACGGCCGTATCTGGTTCTCCACCTGATTGTTGTCGATGGGCACAGCCCCATCG 

TCCAGGTAGCGCGTCAGCGCTCCCCAGCGTTTCAGGCTGTAGTCGAGGGCTTTAGCTGTGGCCGAGCCGTTGGGCACCAGGTCG 

CGCTGGGCCAACATCCAGTCATGCAGTGTTTTGCTGATCGGTACCGCCATTTCCTGACGTATTCGCCAACGGTCTTCGTTGCTC 

ATGTCCCGAGCCTGGCGTTCAACCTCGTACAAACCGCCAATTGAGTGCAGCGCCTGTTCGGCCAGTTGGCTTTTGTTAGCGACA 

TGCAGGTCGAAGAACTTGCGGCGAGCATGAGCCATGCAGCCGATTTCAGTGATGCCTTGTTCAAAACCGGCCTTGTATCCAGCG 

AAGTCGTCGCAGACCAGCTTGCCATTCCAGTCGCCTAGGAAGTTGCGTGCATGTTCTCCGGCACGGCTTGGGCTGAAGTCGTAA 

ACCACCGCTTTGAGCGCCGAAAACGGCGTCGTGCTGTAGGCCCAGACATAGACCCGGTGGGTTTTCTTCTCGCCTGGTGCT^G 

ATTTGCACCGGTGTTTCATCGGCGTGGATCACGTCCTGGTTCAGCACGGCTTCACGCAGTGCATCGACCAGTGGCTGAAGCCGC 

ACGCCAGTTTGTCCGACCCACTGCGCCAGGGTCGAGCGGGCAATTGGCAGCCCGGCGCGGCCAAAGATTTTTTCCTGTCTC 

AGCGGCAAGTGATCGGCAAACTTGGCCACCATCACGTGGGCCAACAAACCTGCGGTCGGGATGCCTTTATCAATAA 

GGCACCGGCGCCTGGATCAGGGTTTCGCACTGACGGCAGGCCCATTTGCCCCTCACATGTTGCTCGACGGTAAACAC^ 

GTGTAATCCAGCTTCTCGCTGACGTCTTCGCCGATGCGTTGAAGTTGGCAGCCGCAGGCGCATTGGGTATTTTCAGGTTCGTGG 

CGAATCACCGTGCGCGGGAACTGCGGCGGCAATGGCGAACGTTTCGGGGATTGCCGTGGCTCGGCTTGTGGCGAAGCTGGAAGG 

AGTTGTTTCAGCTCGGCCTCGATAGCTTCAAGGTCGGTGTCGAGCAGGTCATCCAGCAAGCTGCCTTGCGCCGAACTGATTTGC 

TCGCTGCGCTTGGC?WVCTTGTGGCGTTTGAGCAGAGCGATTTCGAACTTGAACTGTTCGATGAGGGTTTCATTG^^ 

TTCCTGCTCATCGCCTCGACCTGGGATTGCAACTGCAACGCCTGTGCCGCCAAGGCyVCGAAGCTGTTCCXSGGGTC^ 

AGGTTGGGCGAGGAAGTCATGCCGCCGATTGTGCCAGAGCT^GGCGCGATGCGACGACAAGTAGACCAGCCTU^TGGCCGGCGTT 

TACAGCATGCTGATCACACCGCCTGTGCCGACGCGCTGCCACGGCAGGCCCAGCACCT^GGCCTGGAGTTGTT 

TCGACCTCGCAGCCATGGCGAATGCCGGGCCAGTGGAACTTGCCTTGGTTCAGTCGACGCGCGGCAAGCCAGATGCCCACGC^ 

TCGTGCACCT^GCACTTTCATTCGGTTAGCCCGGCGATTGGCGAACAGATAAGCGCAGTGCGGCTTCGCCGCACCC^ 

ATTACCCGGGCTAATGCCGTCTCGGTGCCGGCGCGCATGTCCATCGGTTCGGTGGCTAGCCAGATC6CATCGATGCGGATCATC 

GAGCGAAAGCCCGGATGAATTGCGCGCAGCCCTCGGGATCTGAGGCTGGCTATTTCACCGTGATCATTTGCCCGGCCATGGGCA 

GTTCAATGAGO^CTGACGTTTCGGCTGGCCGTTTAGGGGTGGCCTTCAGCGGGACAAAGGCTGGTAACGAGGCTGGTAC^^ 

GGTCTCGATAAAGCGTCAGCCATTTGCGGATGACATTGGCX5TTGATGCa3TGGCTGATGGCGAa^CTGGA» 

GTTGCAGGCATTCCTGAACGACCTGGGCTTTGAAOSGTTTCGGGTAAGAGCTTCGTTGGCGCATGGAAATCCrGGCGATAAGGG 

TGATCGCGTCCGCTTAAAAATACGCGGACACCATCGCCCTTAATGCTGGAGTTCGGAAGGTGAGTTCGCCGGACGCTTACTTTT 

CAGGAGACTACGAGTCATGTGCCGATGCAGTAATTGACTTGCCy^TGAAGACAGTCAAGATTGACGC^ 

TTATCATGACATAGGTTCTAAAAGAGGCTCCATTGGAGCCTCGTATTTTTGTATGGTCGGCGTAGCACT 

TGCTTACCCATGAACTGTATTCCGTTAGATAAGGAGCTGACTTTAAGTGAGTTCTTGTAGCCATAGAACTCCCCAAGACCATCC 

CCGTCGGGAATTTCAGTGATTCCATTTGCaATGACCGTATGGCCGCCTT^CCTAGAAGCTTGGGAGCATAGOT 

TCTGAAACATAAAGGGTGGGTGAGCGATGGTCTCTATCTTGCTTCCAAGAAAAGTACGAGTGTATAACTCCTGCTAGTACC 

CTAGCCGGATCATTCCTGGAGAAGAGTACGACATACGGCraCCATCGACTCTCATGCTCACATGA^ 

CGATTCTCTAATTTTTTTATGAGATAAAGTAAGGTTTCCTCTGAGCTTTTTTCGAGAGGAGGTCGGCXST^ 

ATTTCGCCAGAGGAGGATGGCTGCATCCATAAGGGAGGGTATGCGCCTGAAATGAGTTGATAGATCGCTGACTTTATG 

AAGGGGACGCAAGGGAAGTTGGAGGAGCCAAATACTTGATTACGAGACTTTGCATG^ 

GACCATCCTTTATTACGTATTTCCTCTTGGATGTTOTTTGGCT^TAAACTGACAATTGACTCGATCa 

CTAAGCGAAACGGGAAGGTTTCXKSTCTCGCAAGTCTAGAAATCGAAATTTATTTAGAGTTTCT^ 

GCGTCGAATATATGiMTTGTTTGGATTGAGGTCrCTTTTGTTTCGATGAGGATTAGTCCX^ 

TCAAGAAAAAGGACTAAGGCTGGAAGTATAGAGTGCTTTGGGTTTCCAGTAAAAACGACGGGGCCATTCTCGGTATATTO^ 
TCTAAACAATATAGCGCTTCCAGTACCCCCTGGTGTGTATCXSTTGCGGCCACGGATCACCCAGAATGGAGCGCrTCTATTa 
CCAATGTGCTGCTCCAGTTTATCGACyVTCAGCGAGTGAGCGGAAGCCGaSAGAAACTGCTACCGCATTT 
GGATATTCTGTTCCAAACAGAAGAAAGGAGTTCTTCTGCAAGOSTTTGGCATCGCGTTTCM 

GACGCyVGTGAGACTACGAATAAGAGATAAAGCaVTGTTTTCCCCTTGTGCCGTTCGTGGCACTTACGGAGGTAAGGACTGGCTC 

GCTGTGG»CTTAGTGCCAATTTTGCGAGGTTCATATAGTGACAGTGCGTCCCTCTCGGGTTTCATGCGCOT 

CAAGACTGTACCCCATTTATTAGATGACGCACAAATCTTGATTTTTTTATCCTGCACTTCAGTGCCTATGTGGTGAG 

TTATTTTCAGCTGTCTCTGGTGCTGTTGCTAGGAGATGCAGGTAGTTCAGACAAGATGTGGAGAAGGACTCCCAGAAAA^ 

CACCCX3AAGGTGGCCTATGAGTTCATGCTGCGATGAGTTGTCGT6CCAACGCCATCTC» 

AGTCGAGACCTGACTCGGGAACGTCTCCCGACGTGCTCTGAGCCACAGCX3ATCTTCTTGGCCATCAGCTGTAAGCA 

GCGAGCTGCCGGCGTAGCCGAAGAACACCACCCTCACCGGGTGCTTCTGCCCGATCaSCCACGACCGCCXS 

GGGTGTACACGTTGTAGCCCGTTTGCAGGAACGCGATGGTCGGGAAGTCGAGCAAGTCCAGCCCGGTCTTCACCAGCT 

TGGTGATCAGCACATCGATGCCGCGATCGACCTGGTCGAGGATCCAATCCTCGCGTCGAGCGGTATCGACCGAA6CAOT 

CTGCCACCTTCAGCCCGGATTGCTCGAGCACTTTCTTCAGCCTGGACGTGGTGTCGCGCGTCCCGCTGTAGACGGTGTATG 

GAACCTTGCGGCCCTTCGCTTTCTCCTCGAAGCAGAGGTCCACCAGCACCTGCTCCTTGGGTATCT^CTGCTCGTCACC^^ 

TCGCTGGCACGAAGGCCAGTGTGTCCaSGGTTCGCGGATGCTTGACGATCTCCGGTOSGA^ 

GCACGTTGAGGAC»CGCCCAGGAGCGTGGTATCTCGTCGCGCCAGAGCCTGGCGGAGCTCCGCTGTC7VGCGTGGCCGCC^ 

GCTGATAGGCCGAAGCCTGCTCAGGCGCO^TGGGCACXSTCGATGAACTCCTCCroGTAGTCGGGCAGTACG 

CCTTGAGCTTCAGGAACyVCGGTGAACGGauVTACGAAGCGGTGGATGCCCTTTGGGCCGAAGCCGGGAGCCTTCACCGTG 

CCGAGAGCTTCTTGCCCCGCGCTGTCTTGTGCGAATCACCGTCGCGCTCGGTGTAGATATCCTTGAGCACACC^ 

TGAACX5ACATGGCTGCGGGAGCCATGCTGCCGCGCGCGTTGGGCCGATAGCCGTCCTCGATCATGCGCTGGGT^ 

ACy^GGAGATAGAACAGATCXSTCGGCGTAGCCGCCCATGAGCGTTCCGGTCAGCAGCACGGTTTTCCGTGCCTTM 
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CGCCCATGGCCTGGCCCTGGGCCGAGCCGCTGTTCTTGTACTCATGTCCCTCGTCCAGCACCAGCAGGTCGAAGTAGCCAT^ 

GTAGGTAGCGCTTGATGAACTCGGTCGGTTGGTAGCCGCCTTCACCGAAGCCGAACTCGATGTTTGCCATCGATCGCTCCATGC 

GTTTGGCCTGCCGATCGCTGAAGACGAAGTTGCCCTTGGCGTCCATCAGGTTGATGAACTCCGAGACGTTGTCCACCAACATCG 

TGGCCAGGAAGTCCTCGCCGAAGTCGTTCAGCAGGCGCTCCGCCCTGACCGGGCCGATGGTTGGTATCCGGCACATCGACTTGA 

GAATCGTTGCGCGCCGGTTGCCGCCGTCGGGCTTGCCTGGTCGGATCAGCGTCCAGAGCGCCCCACGGCAGGAGGAACAGGTAC 

GTCGACGGTCACCACGCTCGAACTCCTCCACCGTGACCAGGTTGCCTTCCAGGTCCTCGAGGACCTGTCCGCAATCCGGGCACG 

CAGCGAGCAGTTGGCCGCCGGCGGCGCGTTTCTTCCyVGCAGGCGAGCCGCCAGTGGAAACCCTVTCCGCATCCGCACGCG^ 

GGATGAAGAACTCCTGGCGCCCGTCGTAGGCGTCGCCCATCTGATCTCGCAGCTTGAGCAGCTTGAGTAGAGTATCTGGGCCAT 

TGAGTACCCAGACGCGGGCGGCTGGGATGGTCTCCAGGATCTCGCGGCGCCACTTGTAGACCAGGTGCGGCGGAGAGAC^ 

GGGTCCGGCGATAGCCGGCGGCGTGCATGACCGCTGCGACAGCGATGGCCATCATGGTTTTCCCGGTGCCCATCTCGGCGTTGA 

TGATTCCGGCCTGCTCGTTACGGTCCAGCAGCAGGGCGGTGATGGCCTGGACGACCTCGGCCTGGGCCGCGAAGGGCTTGCGCT 

TGAGTCGGTCCATCACCAACTGGCGGTGAGCGTTGACGGAGCCGGTATAGACCGGGGGGTTGGAGCGATTGAGCGACTCCAGGA 

GCTCGTCGCCGAACTCGTCGATGAAGTCGGTCAGGTTGATGTT<^GGTGGGAGGCGGCGAGGGCGGCCGGTTGGGTCAGAGCGT 

TCATGTGCATCTCCTGCAGGTGAAGAACGGAGCAGGGCGTGCACGGTCCCCTTTGGGTATCGGCATACCCTGCAGGGTGGAGT^ 

GAAGGAATCGCCGCGGGGGCGAGGTCAGTAATCGCTAGGGAGCAAAAGCGTGGTTGAGCTGCGGTCTGCCTCAGTGATGATCCA 

GAGCCTGGATTCGTCGCCGGCGTCGATGTCGTAGGACGAO^GCAGCCGATCGCCGAACTTCAGGGCTCTCTGGTTGGTGT 

GTCTTCCTGGTCCAGCGTTCCCCAATCTCCCGCCAGATGGCGTTTCAGCAAAGGCGCTGGGTTGAGTTGACCGGTTTCC^ 

GTGGCTCACGGCTGCGGTCATTACGACCCGGCCAGGGCTGATCAGC7^GCGGTGCGGGTGCGGGGGCaX5CTC^ 

TTCAGCTTCTTCCGTGGTCGCGGCCGAGGAGCTGATGGTCAGCACGCGGCCCTGATTGACCGAGGAGGGTGTO^TTTCCCATC 

CCGGATGATCGGGATGAAACGGTCGGTGAGGATCCTCACCTCGGTGATGTTGCCXSTaSTCGTCCTCGGTGAATTCGGTCTTG 

GACCTTGTCCTTGTAGGTGTCACCCTTCACGACCAGGATCCGGCCCGACTTCGATCGCACGACGCCAGATATOSaSCCG^ 

CAGGGCCAGGGCCAGGTGCCAGCGAGACAGCTCGCGGACTGGTGGGCGCGGCTGCAGCCCOSCTTGCGaS^ 

GTCAGGCCAGAGACCTCGCAGCCGCTGGATTTCTVCCGGCGAACTGCTCCGGCTCCAGGGTTACTCGGTAGAAGTGCTCCAGC^ 

GCTGGTGGCCGGCAGAACCACGTAGGGTTCCCACGGCCAAGCCGCTGGAATTTCCTCX5GCCTTTTCCTGGCCCGCTCCGATCGC 

CTGCAGGCGAGACCTCACCTGATTGGCGTCCGCCCGGGCCAGGTCCTGCCGACGGACCCGGATGCCGAAGATCACCACCTGCT 

GAAGGTAGGATCCGCGGCTGCGTAGATGCGO^GGCCGGTGAAGTGGTTGCTCAACCAGCCAGTCAGerCATCGTCC^ 

GTGAGGAACAATCAGAACCATGACGCCGCCGTACTGCAGa^CGGCAGGCAGaSCTGGTAG^ 

GCCGCTGCCCTGGTACTGCGACGCACCGGAGTGGTCTOCCACCAGGTCGCCATAAGGCGGGTTGAGCCAGAGCAGTCa^ 
CTGCCTGCTGATCATGGTGTCGAAAAGGTCACTGTGCAGCACTCGGTCTUVGCAATCCTCX^ 

GTACTCGACAGCGAGGGCTTGGACCTGATCGCGGCCGAGGGTGTGTGCTGCCTCAGCa^GGGCAACACCCTCACCGGCGC^ 

GTOVCAGATCCTCATCCTTCCCGACGGGGCAAGAGTGAGGGCCTGaVGAGCGCGTTC^ 

GCCGTTGOSTGCAAAGTTGCGaSCCAAGCGCGGGAACATGAGGGCCATGGGGCCTCCTTGAGTCTGAGAGC^ 

TGCGCCTGAGCATCGGTGGTAAGAATGCTTTCGCGGATTACCTCACCGAGGGTGGGCTCAAGCACGTCGACCCGCAGC^ 

OTCCAGGCXXAGACGTTGCCGATCGTCCOK^GTAGGGCCGTCAACATCTGGTGCTGGGTGAGAACCT 

CT^GTGCTGCAGGAGGGGGAGCXSGACAGGTTTCCy^TCACCAGCGGCa^GAGCCTCTGGTC 

GCGAAGGOSAGGTGATTGCCTCGGTCGGGCGCCGAGGCGCGCCGATCGAACAGCCAAAGATTGAGCAGGC^ 

CCGCGGAACTGACGGGTGGTGCXSTTTCTCCAGGAGATCCTGGTTTGGGAAGACAGGTAAGCGGCGGCCGTCCACGATG^ 

AAATGGTCGATGCCATTTTCTTCCOSGCCCAGCGTCy^GCCTGGCCAGGAACTCTTGTGTCACTC 

AGAAAGACCAGGTTGCACTGCTCGTCGCACACGCAGGCGTCGACGTACAGGTCTGGACACTCTTC^ 

GGACTGGGCATAGGAACCTCCTGGAAGGAGGAGCCACGCCGCCCTCAAGGGGCGGTGAAGCCCCTCGGGGTGTAGTCC^ 

TGCAGGGGAGCGTGGTGGCTACCGAACGCTCTTGGTGTGAGGGTCGAAGCTGAGTCCATCGGCTTGGTGCAGCG^ 

GATCAGGAAGACCTGGCAGAGGTACTGATCGCGTAGCGCGACAGCCGCTTCCGCCTGCTCCaAGGTCAG^^ 

GAACAAGTCGATGAGCTCGTCGTCGCCGGCGACGTCGTTGTTGGACAGTTGGTCCTCGAGGTAGGCGAGGGTTTCCTGGGTG^ 

GTTGGTGAAO^TGGATTCATGGTGTGCTCCTGGTGGTAGACGAGGAGCGCCCTTGAGGGAGCCCCTC^ 

GCGGTCAGGCTTGGTGTTOSTGCGTCAGGCGTTTACGCCATTCCTGGTTGATCTCATCGTGAGCCAGGTC 

GACAGCCGTAGTAGAACGGGCCGGCGCTTTCCGGAATGGTCTTGTAGCCCGATTGCCC^GCCGAGCAGTCC^ 

CGATCATGGTGATCXSAGTGACCGGCCGGCTTGCCGAAATAGAATCCGGCAAGGTGAAAAGTTCGTTTTACGACAGTCC^^ 

CATTGCCGGAGACTGTOTGTGCCAGCyVCCTCGGTGCTGCCTGCGAAGGTACTGGTTGGGGCCAGCAGCTC^ 

CCTTCGTCIX3ATGTGAGAAAAGCCATCCCATAGGTGTGCTCCTGTGGTTGGCCAGGTCTCCCTGA 

GTGGTAGGGTGTTGCTTCAGCGTGTTGCGCGCTCAAGGCGCTGGAGGAGCCGGTAAGCTGTTTGCCTGGCCTGCCAGTCG^^ 

ACT^CCAGGGCTCGTTGTCGGCCGCATGCTCCGGCGAAGGGGTCTTCACCCGCTCACGCAGTGCXS^ 

CGGATCGGTCGAACACTGCCGCGCGGAACTGGTCAGCCAGATTGTCO^GTCCGACCTCGCGCAGCGAGGCGC^^ 

ACTCCTCGCTGAAGCCTGGGGTGTTGTAGATCGATTTCATGCGGCCTCCTTGCGGGTCCACACCTGTTGTTTGAAGTaSAG^ 

ATAGCCCAGCGCGCCGAGTCGCACGCACTGTTGGaSTAGTCGCTTGCGGTCGACCGTGGTGTCCAGCTTGACGATTT 

CGGCCAGACCGTCCCGAACAGTTCTGCATCCGCGTCTGTGTCCAGAGAGGCGGCCTGCTCTGCAGGAGCCGGAGTGTCCATACC 

GAAAGGCGCGTCCCTGGTTGCAGCGAGGaVCAGAGGCTTCGACTTCTGGGGTGACGGCGACTTTGGTGGTATra 

TGTGGGGAGCTGGATCTCGGGCACTTCGTCGAGCGGATCCACCTCATTGCCGGAAAGACGCTCTACCTC^ 

GTCCATATCGTTCAGCGTCATGCTGTCCACTATGGCGCGGATCTCGATGACTAGCCGACCGCCGGCGGAGTAGTGGTGGGGGCG 

GATATTGGCGATGACGAAATCACCGCGGTACTTGCCTTCGGTGTGTTGATCCAGGAAGGCATCCTTGATGATGAACTCCCCGAT 

CGAAGTCCAGAGTTTTCCCACATTGAACTCGCCATAGCGACCGTTGATGGTTCGAATGGCCAACTGTCCGGGAACT 

GGTATGACCTGCGAAAAGGATGATCCCGCCGAAGCGGGATCGTGGTGAGGGGATGAACTACTCGGTAACCGGGTGTTC^ 

GCACTCTTCCTGO^CTCGATGCGCAGGTAGGAAGGCTCATCGGACGGCTCGACTATCGAGAGAACCAGa^ 

GTCCTGCCCCTCTTCCTCCTTGTGGTACAGGTGGAATTGGATGTGCTTCGCGGTCGGCTGGAGAGAAAGCTCCTGCCAGGCGGT 

TTGAACGACACTGGCCAGCCGGGTGCCCATCTCGTCGACCCTGGCAGGGTTGGCGAGATAGACCAGGCGCAGCa^ 

GGTGAGAACGATCGGGAGTTTGAGGTACGCAGCAGTGTGACCTGCGCTGGAGTGGTTGTTGAGAGAGGTCATGAGAGGTCTCCG 
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FIG. 21 

aatacgcy^ggcctctcccgcggggatgaagtccccgcgtgggtgggtaggatgcactccatcacttgcgtgatgcgaccgcgct 

gccgagtgctgtggcggtcttgggtgggttagcgcttgcgcgccggagccttaaaccctggctccttgggaggcgctgtcaacg 

acagcgagccaattcgaattcgggggctggcgggatttcccgccggccggattgctcaatcgtcataaagcggcaot 

cgtgggcgcgaatggatcaagtaccaggagccgaatgtctgcttgtcccgcgaggtgaagcagatcgacaaggtgttcgtcgag 

gccgaaggcaagctgttcttgtcgaagagcggctgcggtcatctccaggatctccagacactcatccgtccaaggggtgttgtg 

caagcgaaagcccacggcaccgtgctcgttacaccagaagagcgataccaacaccgagggctu^gcgg^ 

gaacctttcgtagaggctattccacgcgctgatgggtagatgtgtttttctgatctcccagcaggggttataaccgccgccgaa 

cgaaaggtatcgaacgacactctcggcttcggtcagtgagtattgatctccgataagcttttttctgccgtcctgctcaccgta 

gatgctgtaaagtacgtttctcactattccttggtgagggcatttttcaactctttcatcgtctaactcttggactaccacggc 

gaaatcgttattgaagtagcacagctcgcactgaaaatattcgtctgggaggtgctgctgggtttcatgcagagcgcggta 

cggtagattgttgccttcgtcagtgatctgcaggtttcgctcgatgcggaaactctcatagccgcgaataaacgggttgagttg 

agtcatgggatatctccggttgccgggaagcccatgcccttggaggctatggacctccaggtgagtaaggggcgccgaagcgcc 

caggtgatgtcgagttgttgcagttaacgcgataggtgttcttctttacgccgagccttctggtatcgctctaccaggccatag 

tattgaagtaccggtgatgtgcttgagtcgcaaccatgctcacagtcggcctgcgccaaagcttcgattgcat 

tagagctcgatactgagtccggcatcgccaaattcatcaacccggcctttgcactggatgtactcgaacctgctttgtgcagca 

tgcactccaatgttgagtagagctgggagcctttctggatatgtgttcatggtgagttccaattcgtaatcaaatcggagaggt 

cacctcccagcaggggtgatgaccccgtctgggttgaattagaagatgcacatgggcaaaagggtaaagcttaatattccagtt 

gcgtccaagtgccgtggtctaatgcctgttgggcaagttcgcgtgaggggtaatattcaaccgactcgcgcgaggctggtcctt 

cttcgtcagaagtgccgatgtagtaagcacggttcttattcataagtacctgcaacggcagttccaggctgagatgggcgagcg 

ccagctttccgtattgcatggtgattctcctttcggtaggaaacaccagtcccatggggactgaggtcccctaaatgggaa^ 

tttggcgattatgccagggcagatacctgggcattgttgttggccaggaacagccaggasaagttgcagaactga^^ 

agcagctcgtcctcgacgtcgctgaggctaacctcctcaacttccggcagggcgggcagttctacgatttggatgttgctcatg 

atgatctccagataatggagaacaccacgccctggggggagatggtgccccccggatgggtatgcgatgaagagtggra 

cgtaggtagtgaaaggccctcgggatgagggccggtattcaatcaagccgtgttcaagcagcggcttgttct^gcgtgct 

ggggctgcgggctcaccttgaggttcgcctttgccttgctgggctttttcagcttcttccttascatcgta^ 

tccactttgatccaggagatgaagagcagccggcctttgaggcttgcgtcgggcttgccttgtttctcgcctttctc^ 

aagggatcoscccagatatcgccgatacggaaggaaatcagaaccttcttcttggcctasactgcttccttgc^ 

agcttttcagcctgggcgccgaccactttgcagtcgatgtaggagtattccacgctgtctgccgcgccatggagggctgcgacg 

gttaoxscgaggaatggttcacctcggcggatcggtacctoscggatgcgattgaggtagccgataccggtggtc^ 

aaatacttggcttcttgggcttgggtgttgttgctcatgatgattctccagtttcgtaagcgaaagcggagaaacacct 

gtcagggaaaagtattccccgcatgggtcggtattgggaggtaaatcagagggagagggcttccatcaacctc 

atcgcgctaccggaagctggtgcgatcttgggttcgatctacacggaagtccgtgtcgtagtctttcagatcatgactactggg 

ggagggttgccgacaacxstctigcagctcacgtggtaaaggatgggcaggggaaaaatgaato 

agoscccagggttgggasctttcy^tggtcaatgatgacggggaagaatccgatgttttt 

actgaaccagagggctatgccggagcagataaccaccavggctggtcatgatgatcatgccgaactcggtctgggara 

cctcctcggtgagagcgatacctatgagccagattaccataccatgacgaaagcatatgccxstc^ 

ggaagatgcccaccaggctcgccgcaacggcagtggtaccgatttttcggagatastctgcgaatgcac^ 

yqqttttcctcgaaaaatgccggaaaaccaccctgcgggatggttcccggcagggttgggggtggtagcagtgg 

ccgagagcagctacttgttcatccgtcatggtggacaagtagtcctgccagcgttgttggtattgctcctgagtacgccgatgg 

agttcccaagagactgcgctaccaggaggtgctggcaaaagagcaatctcatgatcgttcagggactctaact 

ctccccatttcgctttcggagttgggtgcaatcaaatggtt^gggaacccatcttgagctccttcct 

ttggcggtcgccttggccggcttgggtttgcgtgcgcgcttgggcttttgcacctcgcgcgtattggctagataca 

aggttgaccttcttcagccggacgatggctgggatgtcgctgatgtgttctagctcgco^gcggtcgggccggcc^ 

ggatc^tgcgctggactcto^oscaaccggcx^cttgtgcrtgcactacgaccggagto 

otcacaacactgccgagggcaaagccgatttcgaagctggtggattgcttggacatggtgtttctcctgcagg^ 

gagaaaacactggccctggcggggaagtggattccccgctgggttgcggaagggagcttctatcgaccgaggccgaccgaccgc 

gctaccagaagctatgcggtcttgggttggato^cacggtgggctgtgctgcagcatttcggatcc^^ 

gtcgctgao^cgtcagcagatcaascgttcata^tcgaagcggggggcgaaagtgt 

gcagcgggcggagagcgagggcggaagagcgoxsctggtagaaaccctgtcaaaggtttccc^ 

caagtgagccacctgtggttgaatcgtcgatccctgggcattgatcgtctcgatcccatcacccggccat™ 

cagcaaacagtaggcacga^tccgostacaaagggagccctgastatcaccggcgggccacc^ 

ggtagcctgatagtcctggagcaggagcatcaggctacccatggagaggggaaaaggaggggccgtaacaca 

aaatcgaggaaacaccgaacctcttgagaaccctatgtgcctttaaactcgctcgacacgagttgacgg 

aagaacagggagacacagggagaattctcaaccagatccgcacgccccacccggtgttgtagcgtc 

ggcgaacctgcatcaccagggcgctactgatgacostcagtagccagcggggtgaaatcy^ 

aaggccggtttatcaagctgcgccggggtttactggcctgtggaggttatttgactocgaccaccgcgccctot^ 

tagggagcgct^tgacagtggctccgaggcggcgcgtgactaactggccgttgcgcctgacgco^cc^ 

cagcyvtcovcggcosccgccagccccgccggcggcgaggasatgaaggcgtagctggtgc^^ 

gcgctggcttgcggaaccaggccggtaaaca^gagosgagtccgccggcgaaccggtgaaaccggggttc^^ 

ccgccagactggaccggaagagcagcaagctcctggccy^gggtgtcgacctcggccagttcggcgctcgaggtcgcgt<^ 

tttgtacactcagccaggtcccggtgatgagcgcgagcaccaggacgatccacatcagcggcatacggtagtc 

gcgacctgctagttcacgcaggtaacggtcacgttggtcgagtcgggggtcttgtagggttccgggtagttgaagcctgggccg 

atcto^gtgattagcccgaaosagtagttggtgccaaatgtctggttgaagcccactgcgcagaatcggccgccgcaggt^ 

gcgtagcggacgtagtcgtagttcaccatggcgctatcctgaccat6caac6tacaccagtccttgatcacgccggccgtggtg 

ttcaggcg»tggaggcggcggttcgtcccc7vtttcccgttttggcaggacagccagagtca3gtc 
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FIG. 2J 

CCGTTCGGCGAGCAGTTGGCTCCTTCGGTGGCCACGCCTTTGAGCTGTAGGTACTCGCCGACTTCCGTCCGGCCCTCGGCGGTG 

AGTTTGCCCGCTTTCATCTCGCCGCCGGTGTAGACGTTCTTGTTCATCCAGGAGCGCACCCAGGTGCTGTCGCTCATGTACCAG 

CCGCCGCCCCATTTCTCGTTGTACCAGCCCGTGTCACCACGGGTCCTGAACCAGCCTCCGGTGTACGTCTCGCCGGCGACGTCA 

GCAGTGCCGGCTTTCACCGTACCAGTGGCTGTCACGTTGCGCGCGCTGATGTCCGCGCTGGTGGTGATGTTGCCGCTGGCCGTG 

ATCGCCCCGGCTGCGGCGATATTGTTGCCTCCCATGTCCAGCGTGGTATTCATCCGGTTGAGTTCAGGATGACCCGGGACGGCA 

TTGCGGTAGAGGTACTCGTTGGCGATGGCGCCGTCCTGGAAGAACAGCGCCGTCGCCAGATGTCCAGCGCCGGGAGCGCTACCG 

AAATTGCTTAAGGCCACCTGCCAGGCGGCGCCCTGGGCGATACTGGTGTTGGTTTTCGAGATGTAGCCCCCGGTGGCTCCCATG 

AGCTGCGCGATCCGGCGGATCGAGAGTTCGGAAGCTACCTGTCCACCCGTGGTCACGATCAGCGTTTCGAGCTGGTTGGCCGCC 

GGCTTGCGGGCCAGGACCTGGTATTGCTGGCCGTAGATGTTGGTGTCGCGGAAGCCTGCGGGCAGGTAACGGGTGTTGCGCAGC 

ATCGGGACGGTGATCACCGCCGGGGCCGTGGCGCCGGCGCTGGCCAGAACCGTGCTGAAGTTGTCCTTCAGGTACTTCTCCGCT 

GCCTTGGCCACCTGCTGTTGCTGCTGGGCCGCGTGCTGCTCGTCCAAGCCGTCCAGGTAGCTCATCAGGACCGATATGCCACCG 

GCGGTCGCGATGGCGATCACGACGAGGGCGATCATCAGTTCGATCGAGATGAATCCACTGCTGCGCGTACTCCTCATGGTGTTA 

TCTCCATAGGCTTTGCCGGGACTGGCGAGCTTGGTCGTCCTGCAGCAGATGAGCCTCCTTTCGGCGGATCTTGTCGCGCATGGA 

CGGGCCGTCGTCGCCAGTGAACAGCVIGTGGCTCGGCGGTCAGGCGGCGCGAGGAACCATCGCTCTCCAGCGCTTGGCAC^ 

AGCTTGGATGCCTTCGGCAACCACGTCGTAGGCGTTTOTCGCCAGTGGCTGGGCGAGCGTCACCAAGCGCTCGATCGCCTC^ 

ACAGCTTTTCGCGTGAATGGTGGCGATCACCAGGCTGCCGGTCAGAGAGGCCTTGCAGGCCTGGTAGG 

GATCTCGCO^TCAGCACa^GGTCGGCGCCGGCCraCT^GCGTGCGCAGCAGGGCCTCGCTGTATCCC^ 

TCTGACCTGGATGCAGCGGCCGAGCCCATGCTGACCGCTGAGGTTGGTTTCCTGCGGGTCCT^ 

CAACTCCTGCAGGCGGGCCAGGAGCAGAGAGGCGGCGGAGCTGGTCTTGCCTGTCGCCATCTCGCaSC^ 

CTGCAACTGCGGATCCATCAGTGCGGAAACCACTTCGCTCGGATAGCCGATCTCTTGGAACTCCCGC^ 

GCGCAGCACXSAAGACGCTGCCACTGAAAGCGTCTTCGAGGAGGGTGACGCGAAGCAGAACGCCGTCGAC^ 

AGGGTCCTGCAGCTCCTCCAGCGTTTGCCTGCAGATCTGGAGCAGTTCCTGGAra^ 

TCGGCGCGCGCCGGCGCCGGCCAGGCCTTTO^CXSTCGGCGAAGCCTTCACaSAGGTAGAGGTCGACGAACTG^ 

CGGGTTCACACTCACGAACTATAGGTCCAGGTAATGCTGTTGCTGTCGCrGCTGCAGGCCTGGGTCGCGGCTC^ 

TCTCCGGTGATCGAGGATCaSCTGTTGATTTTGGTCTGCTCGAAGGTGTTCTTOTCGATCTTGGTGGCCA^ 

TCCTGGGGCT^CTTGCTGGTGGTGATCGAGAAGCCCATGCCGGTGGACGAGACAGTGACCGATCCGCCGTAGACGTTG^ 

ACGCCGGAGGAGACACTCATGTTCTTCGGCACGCCGTTGATTGCGATa^GGCTGGGGATCAGGTTGGTACCGCTGGAG 

CCCGAAGAGGTCTTCAGGGCGCGTGCGTTGGCCGCAATGACGCTGATGTTGCGTTGTTCCTCGTTGGCGTTGGACGAACTGAAC 

ATTCCAGCCGCTGCGGCCAGGCCCAGGCCGACCCCGATGGCGATGATGATCAGCACGATGATCATCTCGATGGAAA 

CCCTGCGTCGGACGGGAAGTGCGTTGCGTAGTGCTCATGGAGCGCTCCTTGGTGAGGTTTa^GTGGTTCATC^ 

TGCTGTGCCTGGTAGGCGCCGAGCAGGACCAGGATCATCAGCGCGCCGATGAAGATCAGGGC^ 

GCOVGCTCGATCTGCTTGAGGCTGGTCTCCTGCCAGCGGCGGCTGAACTTGACCAGCGCCTCGGAGAAGCCTCCCCG^^ 

AGGATGCACAGGTACTGGATGGCCTGCCGGTCGGGGAAATCGTGACCGGCGCTGCGAAGGGCAACACCCAAGT 

CCCACGCCGTAGCGGGCAGCTTCCMGCGCTGCTTCAGCCAGGGCGGGGAGATCTTGATCATGCTGGCCAGGC 

CGTATGCCGGCGTTGAGCATGACCGCCATGTTCAGCAGGAAGGTGGTGCCCTGGAGCATGCGGTAGATGGACCAGGGC^ 

GTCCGGTCCAGCCyVGACCCGGCCTTTCCAGCGGTAGGTCGGCAACGTGACGATGACCACCACCGTGAGGGTGATGACGGCGACC 

AGAACGTAGATACCAGGTCCTGTGACGAAGCTGGCAATGGCGTTGAGCGTGGCGAGCGGGCCGGTCCAGGTGACTGG^ 

AGCCTGGCCAGGCTGGGGACCATGCGATAGGCCACGATGCACAGCAGGAACACCATCATGTC 

AGCGCCTGCCAGATGGTGGCGCGGATCCTGGCCTGGGCCTCGACCAGGGAGACGGaVTCGCCCATGGC 

CTCATCTCGCOSGCCTCXlATa^CGCTCGCTCCTGGGCGGGGAGATAGAGCGCCATGGCGG 

AGCCCCATCAGCGCTTCXSCGACTGGCGATGGCCACCGGATGAAACXKSATGCTGCCCCTCATG^ 

GCCACAGCATCCTTCAACGGGACCCCGTTTTCGAGCAGGGTGGACATGCrTTCGTAGAACTGCA^ 

TGCTTGCTGTAGAAGGCGAACTGCAACTGCTCCCAGAAGCCCCCCATCAGCACGCCTCCTGCGAGTAGAAGCT^ 

GATGCTCATCGAAGTCGAGTGGCCCAATGAAATCCTCGACCy^TCTGCGGGTCGAACATGCCCTCGTTGATGCGGC^^ 

GGGCGTGCTTGGTGATGCCCTGCATGGTCTTGACCCAGTAGTTGCGTGCCTCGGCTGGGCOXICTT^^ 

AGGCGAGGGTGGGCAGAACCACCTOSGCGACGATCGAGCGGCCGTTGACCCCGGAGCCACGGC^ 

TGACGTGAACCTGGGAAACATCGGTCMGCGTCGGACCCGTTCXSACCAAGTCGGGCGCGAGTTGGTCTTGGTGG^ 

GCACTTTGCAGTGGGGGO^GAGCTTGGGCAGa^GGCTCTGGTTGATCAGGCCGGTaVGCAGOT 

GGTCGACGCCCAGGTCCTTCAGGCGCTGGACAATGCCGATCGOSCTGTTGGTGTGCAGGGTCGACC^ 

TCGCyVCCACXSGAACGCGGCGAOVGCGGCGAAGAGGTCGCGTACTTCGCOSATCATC^ 

TGGCGATGCCCGCGGCCCAGGCCTGGCGTTCTGCGTCTGGGTCGGTGGCGTCGTAGACCAGTGGGGTCTGGTTGATGCCTTCGC 

CGOSAATGCGGTATTCCXKSCGGATCCTCGATGGTCAGGATGTGCTTGGATCCGCCATGGAGCrTGTCGAGGCCTT 

CCTTCAAGGTOITCGACTTTCCTGACCCGGTGGGGCCGGACAGGATGTTGATGCCGTAGGGCATAC 

GTGCGTTCTGCTCGGGCAGGTAGCCGAGCTGCTCCAGGCTGTCGAGGCCGGTGTCGTCGTAGAGCAGTCGCAGGATCATCA 

ACCCCCCGGCACGCGGGCGGGTGGCGATCCGGGCACTGAACAGGTTGAGCTTCTCGACGAAGGTCTGGCTCATCCGCGOT 

GGTCaVGTTGCGGCTTGi^CAGTGGCTCGGCCACGTCGCACATGGATTGGTAGATGGTTGCA 

TGCGGAACTGCTCGACGGTCTTCAGCAGGCCGTCGACGCGGAAACGGATCTTGCTGCCGGTGCCGGCGGGA^ 

GCACGTCACTGGCGCGCAGCTCAGTGGCCTTACGAATGATCTTGACCACGaSCTCCTGCATCTGGGTGGCGCGCTOT 

TATCGACCAGGCCGTCCATGGCGACGGCGCGGTAAAGCTGGTGAATGGTCTCCAGGTCGGTGAGGTTGAGCTGGTATCGGAAGC 

CACGGCGATCGAGACGGTCGATGAACGACAGAACGTGTATGTCGTTCTGGTGGCTCTCCGATAGATAGAGCGTGCCGTCGGCAG 

CCAGAGCCATGATTTCCTGCy^GGTGCTTCGATACCTCCCATTCACCACCGTCGGCGGTGAGCAGTTGGGTC^ 

GCGCT^GCGCGGCT^TCTGAAGGTTCGTCATGGAGGTGCCTACrGAATGAACGGCTGCGGTACAGCCCC^^ 

AACCGAGGCGCCTTGGGCOSTAGAGGAGGCTTGGGTGGGGGCAACGCTGGAGAAGCCCACGGGCyVCGCOT 

GGTGAGCACGACCTGGTCCAGCGAGATTGACTCGACGCGGTATTTGCOVGGCAGCTCCGCGCCGCTGGCXXSCGTC^ 

CCCGCCCGGGAACAAGAACGTGGCAGTCATCTTGCCGCCGGCGCCGTAGATGGTCCGCACCACTGGCGGAACCGGCTGCGTG^ 
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FIG. 2K 

cgtcggcggcgccgccggcaccggcatggcgaacgtctgcccctgggcgctgacgacctggccggtttcgcctttgccttccaa 

ttgccgctgcgccgtggccaggcgcaccttggcctcggtgaggatggcctgggcctggatctccgcaagttcgccaacggtgcc 

agcgcaggcctggccagacgcgaggaggaagagcacggccacggccatgccgatcggctcagttcgcatagatttctcctgtga 

(^gtccagtccaactggctgtccttgagcgtggtttcgaggttggtgatgcggacaccggcagcgggcagcccctggaaggtca 

ggtctgccgggagccgggtctgggcgctgaaggtgaatttcttcmggaaggcaacaccacttgctgttcagtcgcagcttcc^ 

cgccagggaggggcggagttgtctcctggctgatcgacagcttgggctcgaccccttgacggtacaggtggctcgtcagcgcct 

gcagaacgtcgtccgccggcagtagcggctcatcactgccgatggcaaccttcagatcgaccttcagggccgcggtgttgccgt 

tgtcgatgacgaaggcggggcggtcggcgaacaggtgctggctggccgctgtcaggtcggctgctgtgctgttgccggtacggt 

ggtaggtggcgaccaggaccctgccgtcgcacttgctggattcgaagagccagccctggatcgacagcgacagtacccccgttg 

ccttgctacaggcgcgtagcatgtcctcgaggtct^ggtatgagcgtccaaggcttctgcaatgacgccagg 

tggcctgggcgttcttctcggccagctcggcgaggcgcctctgctcctcgaggagcgcggcttgcctggcgagctcttcctggt 

gggcattccataggtagtaggcggttagcgacccacctaccacgct^ccgagcagggccactgccgtccactccctggcg^ 

agccgaaggtgagttgccggaggcggtagtcgcgccgcaggcgcttcggcgcgagcagttcctcgatgtcgaagtccttgacgg 

ggaaatcgaacccttcgggaacgaagacctgtgcgtttcgcagcacgccgcgcgtagagagcttcctgacccggtcccgggcct 

cgtcgagggtggtgaccagatcctgcccggggacaatcgcgccatcgagcgtggcgaccagcgcgtagcggtcctcgtcgactt 

tccagcaggccaggaagtcgccgtcgaactggcctgaaagcgccgaggccagggagtacatccccttgactgcgccttgcgact 

tcgaaacgaagccggcctggatcaccgtcggtgaatggcggatggcaacgatgtcavgatgctcttccttgcccagcttgcgot 

cttccttcatgtactgccgctggctggacagcggccgccagaacaggccgctgacgaacttgttgcastggtagctcaggatas 

agacgtctggtccyvcggctgccgaggtcaggcttctccatggttcagcccaga^cgaasggggtgat^ 

cgcgcttggtattgcgggtcagcccgccgccaagaccgaagaacccagcgtcgccggtgccgaccttgttggtgtcctcggtgg 

tctggtcgaagccggagagtaccaaggtctcgccgctgcgcagacgtaccttctggtcgaacagttgtatgtcgtagctcggga 

actgggctttggagtccccgctggtctgcatttcgaacgtcggccgggaggtcatgttgatgttgatcttcagcagcatct^ 

cgctttccy^tcacgaacggcy^gcaggctcatgttgtagccgctggtcacggcgcccgggatcagcgaggtggtactgcc^ 

gggcgacgttggagatctggctggaggccaggtagctgtcgtagcggccgatctggatcggcgcosactggaggttgagcgtgg 

tcacggacggggatcggacggtcgagacgcggccctgctgggccagcgcctggaccatggccttggatcc^ 

tggcggtatccaggatgctcacggagccggagatcgcgctttgatcgatgcccggcatggtgttcttcaggccgat 

tgttgttgagcgacttgtagaccaggttccagtcgatccccagttgatccttgtcggtcagggcgaccgagagcac6ttgacgt 

tcagcagcacctgcttggtgatgctctcgttctctcggttgaccaactgctggacacggttga 

cggtcagggtgcccgtggcacgcgacagcgacatgcgtcccatgctcggcgtcagcatcgagttgatgctgttct^ 

tgaggatcgatgtcttcagctccgagctggtggtctgcttgctgcosgagtcgccgctgatgccggagctgccattctc 

tggatccggagccgtccccgctgatgccggcggccgtcgtcataccggaacgcacggtggagtccaccgtgttgara 

aggcgtacatgcggaaggtccgagtgtccaggtagtagaactasacccttttctcggttgggttgtagcgcc^ 

ctokxxrggcgatgagatcgaggaacccgctgactttgccgttccatttgatgttgtacagagaccg 

agctcgaccccatgctgccgcasgcaccgagcgacatgccgttggcgacggaggcaggaaac^ 

raccttggatgggosgcggggasttgctcgcgcgctgctgaggttgcacggcgaaggcggccgggttcagcgra 

tactgaccgccatgtggcattggttgatgacttcctgggcggcctcctgcagcgacgctgcxscctgc^ 

tgcagtovctggacaaggtgtgcgaaacgcttaggggttttotgctgacccagggtttgtcggagaacaccacot 

gatccggtcgggtgttgcgcacctgcgctgcgatcgtagacgcggaatctgcostagcttcxscacacgatcctc 

acccgojtgaccgagcmctggccagcgcggggatcaggaggcmgccaaggaggttcttcag^ 

cxsctccttgatgatgato^gggactgcgggcggctggcgttcaccaggaagggccgttcggcag 

tcggataccgcgtcctcgaaggagcostggaaggtcagtggagcctcgatcggatagttgagatcctgcggctcccagc© 

gtccagcgtgcgoxrtttgcccaagcttccaaggtgtcgcgcagggttgatcccgtctcggctgaccaggccto 

cggctcggttccggcggcgtgagcyvccttgaccggtggggcggaagcavosgtgggtggagtgggcggc 

gaggcoscaggcttggtcggtgcasaagccgcgggaggagaagaggggtggccatccttggccg^ 

gtggtggattcaggcttcttggccggagcgtggctggttgtgatcggcgccggtgtcggcaccgaggcgaccacgat^ 

tccagcgtgctgactttctccgtggaggagggttgcgctaccgccxxscggagttggggcagcgggcttcg^^ 

accggtttcggcctcggcgccgggggaagttgatagcccgggcxscagcacgaagcagacctgccgcgcgacct 

acctgccaggctgggccggagaggacctggagggtgttgcgcagggtcatcgggccgagcttgtactgag 

cgggtgtagaggatgttcacatgaccggcgtctgccgggcacagcgagtaacccgagcggctcat^ 

ttgacgctcgggttcatgctcgacx3ggatggttacgtcgatgatctgggccat(^ggtcgcgttgac^ 

ctgaccagggtgtagcgcccatagcgcactaoksgctctttctccggctgcacac 

tctggattgcggctgctgtcgaaggctggcctggccggctgagccttgtgggtostgcagctggcgagcagcaggc^ 

agagtggtcaactgcctggtcattgaggccccctttgatgatggggcacacagtggcaggccatcc^ 

atactcggcaacaggggaggggatttctcgatgactaggcagagggtgagcgaggcaggcgtaaggga;^ 

tgaaggtttttcatgttacgcctaaagcgagccaggtgggaatcgttgttagaaatggataaaatagaagto 

gaatgctttgttgaactcaggtcagacagtacggggtgctaattgagctttaaatactattgggctaaatttttctggggagct 

ttcttctttgttttagtcgcttggaaaggctccgtatttccaagcctggcatcagttaaccccttggtagtggct6ga 

actatcctgtttcctttctcggtaaggcttgttgaagacttcgctttaaaatataak3aaaaagagttctgggtc».cag^ 

TTCTCCGAAACCCCTGCAAAAACAGGATTGTATGCy^GTCTTTTATTTGGCTTGTTATTTGTTTTCy^TTCCCTTGGGG^ 

TTTTTATTCTATAAATACGGAAAGGCCTCGTAGGATTTAACGAGGCCTTGCTTGGTTTTAAATGGACATTACAAGGTTGT^ 

TTTTTCTAGAAGGCCTTCGTa^Ta\TCGCCCCGGTAACTGCCATTACCAGTGCGAACCCCAGGATGCCTA 

TGCCGTGGCAAATGCAATACCCACAAGCCAACTGGCGGCCGCACCAGCAGCTAGTGTTTCAACT^ 

GTCCCCGGTTTCCXSTAGATATCTTGAACTCTTGGTACAGGCTGGCGGCGTCAATAGCTTTGCCTAaVACTCCAAA 

AAATTTCTCAAGGCTCTTCGCaVTCATCTGCTTGTCTAGAGAATCAAAGGCTTTGGCAATGGCTTGCCTATCTTTAAGGCT^ 

TTTTTTATTTAACX5CATCCTTGTGTTTTTCAAACGAATTGATTGCTTCCGCCGAACTCCTGATATTTTTCCCCCTGGCGCCT 
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GGCCAGTTGGTGCGCCATCTCTGATGTTCGTGCGCCAAATTTCTCAGTTACTTCCTTATAAAAGTCGGCAGTAAATTTAACA 

ATCTTGCAACGCTTGTTCCTCGGCCGCCTTGCGTTCTGCTTCTGTCCTGCGTCGTGTCGTTTCGGCGTTCAGCTTATCTAAGCG 

GGTCTGTAGACTGGCTATCTCATCCACAAGTAGCTCACCATTGTAGATGGCTTTCTGCTTTTCTAGAGGGGTTGGGGCATTCTT 

GATAAGAGTATTGTTTGTGGTCTGCAGCTGCTTTTTATCAGCAATTGCTTGCGTAACCAGTCTTGTGGCACGTTCAAGCCATTG 

CTCTGGCGTTGGACTGACCAATGGAGGCAAAGCACTTCCCGCTTGATCAAGCTCGGATTGTACTTTCATCTCGAGAT^ 

TCTGGTTTTGTAGTCCCGCTCCAGTTGAGAGAGGTCCTCTTTTTCCTTTAGCGCTTTTTGCTTGTTGAGCTCCTGTCTTAGCAC 

TTTTACCGCTTCAGAGAGTGAATCTTGTTTGTAGTAGGCAACCGCCCGCTCCAGTTCATCTCTAACAAGCTGCTGAAAAGGGTA 

CTTCTTTCTTTCTTCCCAAGAGTTCTTCTCCCAGATCTTGACCCAGGCCTCATAGATGGAGAGGCTTGACTGCTTTTCGTAGTC 

AATGAATGGGCCTTTGGTAAAGGTCTTTTTGTATCCTTCAAGATCAGATTCCATCTTGTGCAGGACTGCCGACATGACATCGAC 

TCGCTCTTGCCTTGTGGCCTCCCTTTTTCCTTGAGTGATCTGCTTTAGCAAGCCATCGACTTGATCAGATACTTTCTTGTTGAT 

CTCCTTCTTTTCTTTTGTTCCTTGTTCAACAATGGATCGGACTGGATAATAAATGTTTTTCTGTAGATATATGTCTCTTTGAAT 

CAAGTCGCCAAACATATTCCCGGCAACGGCCGCGCTATCGATAGGGGGAACCTCGTATGCGTATTGATCGTCTGGACCTTGTGC 

GACAATAACCATGGAACCAGGTACTTCGTTGTCATTGGACATTTAGACTTCTCCATTGGTGAGTGTGGTACAGAAA 

ATTCAGCTCTAGAAATTGAACCCCATGGCTTCAATGTGTGGGCAATGCACTAAGAAGGCACGTCAATGCGCTGTGATA 

TAATAAGCACGGTTGCTTAAACTCGATATTCAGGAATGGGATTGGCTGAGCCCACGGGATATTCCCTAAGAGTCATGTATCTCT 

AATAACGAGTTCT^TGCGCTCT^GATTCACGCTGGGTATGTGATGCTCTCTGCAAAATTCAGCTACTTGATCCGCAACACC^ 

AAGGGTGAAGATAGGGATAGCAAGCGAAGAGACAGTTTTGTGCGTTCCATGTTAAACGGGAGATACACGTCGAAGACCTAAGCC 

ATTAAAACGTGCTGACCATCCAGTTGGGGCCCATTAGGATCTAGCACAGCACCCATGGTGCACTGGGGCTATTTCTCGGATC 

CACCGTTTTGGTACAGGTCCGGAGAGAGCAGGTCTGGATTGCGGCTGCTGTCGAAGGCTGGCCTGGCCGGCTCGGCCTTGTGGG 

TCGTGCTIGCTGGCGAGCAGCAGGCACAGCGTGAGAGTGGTCAACTGCCTGGTCATTGAGGCCCCCTTTGATC^ 

TGGCAGGCCATTCGCCATGCTTCAGCAGGAAATACTCGGCGGAGGATCAGGGCTTTTTTATCCAGTTAGGCTTGCGGCTAAAAG 

CGCGCGGTTTCCGTCGACCTGCTCGGGCTCGATGAGGTCCTGGGCAACCTGGCGTTTCGCGTTGAGCAGTTGCCACAGTTGCTC 

GTCGATCGTGGCCTCGACCAGTGGGATTTTCACCACGACCATGCGGAGCTGGCCGTTTCGGTACGCGCGGTCTTCGGCTTGTTC 

CTGCTGACCGGGAGTCCAGGGCAGGCCGAGGAAAAACACGTAGTTCGCCGCAGTGAGGTTGTTGCCCGTCCCTGaS^ 

AGTGCAGATGAA(^CTCGGCAGTCGGGATCCTGCTGGAAGCGATCTATCGCCTTCTGCCGCTTGGTGAGCGAGTCATTGCCCAC 

CAGCGTGACGCAGCCGTGTCCGGCCTGCTCGCAGAGTTCCTTaVGCGCAGCCAaKSTCGGCTTGAACTCGCA 

CTTGTCCTCTGCGTCGAGCTCGCTCAACAGGTCCATCGCCACGCQAACTTTa^CCGTTTCCAGGTAACGCCGCAGCGCGCC^ 

TCGCGCGAAGACCGGTCGGTCCTCGAGGCGCAGCACGTCGTATTGCTGGCGTTCCTCGGTGGAGAGGGCCACCTTCAGCAACTG 

CCGCTGCTTGCCCTTGAGGCTGGGCAGCACyVTCTTTGCGCCTGCGCAGCATCCAGTCACCCAGCTCCGCC^ 

GAACTCCGGGTTGCCGGCGAAACGGTCGCAGAACTCTTTCAGCGGCAGTTGGCaSATGGGGTGGCCTGAGAGGCGC^ 

GTGCAGCTCTGTCTCGCGGTTGAGCACCGGCGTGCC6GTAAGCAGGTAGCGGTTCGGCACTTGGGCGGCAATGTCGAAACCGTG 

CCGCGTGCATTGCGCCGTCGGTTCCTTCATCaKrTGCGCCTCGTCGATGACCATCACGGCGAAGaSCGAAG 

GGGGCTCAACTGCTCGTAGTTGACGAGGATCCACTGCGCCTCTGGGGTGTCCTGCTGGATGGCCACGGTGGCCGAGGGATAGAC 

CTCCTGGATCTCCCGCTGCCy^TTGATa^GCAGGGTAGCCAGGGTGATGACCAGGATTGGTCTGCCC^ 

GGCGATGACCGCCTCGCGGGTCTTGCCCAATCCCATGTCGTCGGCCAATAAGGCGCTGGTTCTCTGmGC^ 

GTCAGGCTGGTGGGCCAGTAGAGAGTAGCCCTGAAGCGCCGCCTCGATATCAGCCGAGCTGTACTCGGTGCGCTCGATCTCCGG 

CACCGCGGCGAGGTAGATGTCAGAGGCTGACTCGTCCTCCAGGGATGGCGCTGCOKSTTCCTGCTGTGGACCGCCGATGCTCAT 

GCTGGGCAGCTCGGTCGACGGCGCGAAGCTGCCGTCGCTGAGCAGCTCCTGGACAGTATCCAGAATCTCGAATTGTTCCTCGGC 

GAGGCaU^GCTCCAGAATCAGGTTGCTTCGGAGCAGCTCGGGAGTGGTGTCXSATGaSCCAGGAACTCGATGG^ 

CCCGCGCATCOSCCGAGCCACCGCCACAACACCAGGATGGAACTCXSCCTGACAGTAGGACGCCGCCC^ 

GCGCAGCAGCATCCCCCAGGTGAAGGCCTGTCXSGTCAAGCXSAGGCCTGOTaKSAT 

CTGGGAGTGAAGGCCTCCAGCTAGCTOSGraAGACGGTTGTACAGAGTGTCCAGTTCAGGCAGC^ 

CCAGTAGCGGTGCAAOKSATGCTCCCGGGCGCrrGATGAAGAAGCCGGACAGCTCCAGCAG 

GGGGATCTTCAAGCCGAAGTraTAGCCXSTCGAAGATAGCCCGCTGCAGAAGCGTCGAAGGTGTCGCGGAGGTA^ 

GCGTGGACGGTTCATCGGTCTTCCAACTGAGTGAGCAGCCACTTCGCCGTGGCTTCGGCGAGGCCCGCCTTCCAG^ 

GCGAAGCGCGTCATGTGGACGGTCATGTCGGGCCAGTTCAGGTTCX3Ta3CTATGGCTCCAGAGCGAATTCGTTCC^ 

CGGTATGCCTGGCCGGCAGGCGAGTCGGTC66CACGGGGGCATTATCGAATTCGCGCAGTTGCACCGTGGAAAGGTGCTTGGGA 

CTCCAGGGCCTACCTATAGGCTCGCCATGCTCACGCCTGAGCTCCAATTCGGCAGAGGCCAGCGGACGACCAGCCTCACG 

GAGAGCAGTCGGCAGTCTCCTCX3GTAGGaX5CCTGGTCGTAGCTATCGATGCAATGTGCCAGAGCGATATO 

TCGGCAGGACAGCGGAGCTCGACGATCTGCAGTCCATTAGGGCAATCGATAAGACCGACAGACAGCyVT 

TACTCGTCGGTGTCGACCAGATCGGGCAGGCTGCTCTGCACTGCGGCACGCAGCTCCCTCTGGACCTGGTGGTAGTCGTCCACC 
AGGTTGCCAGCTTGTCGGTAGGTACAATGACCCAGATACGCTTTCAGGGCGCTGCGCGCCTGCCGAACAACCTGTGAGTTCCCC 
TGGTCGAGGGCGGTATAGAACTCCTTGATGTCTCTCAGCCGTGCAGTGATTGCT^GCCATGCCGGATCCTGCC^ 
CAGCCGTTGAAGAGACGGTTGTAGTCGGGCTTGGCGTTGTGAACTTGCCAAGGGATCGCATTC 

CATTGAGOSCGAGTGATCGGCCGCCGGTTACCTAGCGATGCCGCCAGTAGGAGAGCATGCCATCCCGAATGTCGGCCTT^ 

TTGCGATGGAAGAGAGCTCCTCCCGCACGGCCGGGACTGACGTGACTAAGAAATCGAATCGA6GCCCGCGGCGCCT 

CAGGCGAAAAAGTTGATGACCGAAATTCCTTCGTCCGCGACCTGCCCAAGAATACGGCCO^TATCGAGGTAO^GTCGCA 

ATGGCGGCGGCCTGCGGTCXSTTCACTGTCAAGCTGGGGAAAGGGGCAAGGAAGGTAATGGCG^ 

GTCGTCGTGTGAGGCCAGACTCCCTCCTCGCAATCCACCAGTAGCXKSCGTCAAGACaSGCTGAGCC^ 

CGCACCTGGTCGCCTTGAGCCAGCCAGTTGTAGAGCTGGGGCGAGAGGCACCGAACaS^ 

TCCAGGACCTGTCTGAAGGTTCGCAAGACCGCTCCGATCTGCTTCAAGTAGAACTGGTGCCITGGCGCGTAGTAGaSCTTG 

CTGTGGCTCGGATACACGAGGCGATGTTGAAGTCCGAGATCGTCCCTGGCCATCTGTGTGGCTAGGTCGTGCGCTCGCTTTGTG 

TATGAGCTGAGTACT^GCAACTGCGTCGTaSAACTTTTGAAGGCGCTGTAGCAGCTTC:^ 

GCCGGGATACGATGGGCAGGCTTGCACTCCCACGCACa^TTAGTTGACCGTCGAATACCTGGAGT^ 

ATGGTGTAACGCACACOSTTGTCAGCAGCGACTGTGATGCACCAGCACCTGGTCXSTAGATGTGCTGCGC^ 
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AACTTGGTTCGAGCTGTCCCTGCGAGGGAGAAATGTGTTGCCCAGGTGTTGGCGACGGCGAGCTGCCGCTCGCTGTAGCGGGCC 

AGGGATGGATCCTGTAGCGGCCCGCTAGGGGGGGCGTTGTCGAGAGGTGCCACGCGCATGCTCCTGTGTAAGCAAGCAGCA^ 

TGGCCGGTCGGTTTACGCTGAATCCATGTGAAATGGTCATCTCAAGCGTGTTGGATTTCAAGGTTCTACAAGCTAGCTGACTGA 

AGGTCTTATTGGGATGGATCAGTAAATTCTCTTGTAGTCAGCCTCAAAGTCTCCTAGTGCAGCAAACTCGTCAAGGATAGCT 

ACTTAAAAGAATAAGATGTATTGTCCTCGCTGGGTTGTCCCAGTTGGCACCACCATGCTTGCCTCCGTGAAATAGATTGTTTCG 

CACGGTCTTAAGTAAGCGAGCGACTCTAGCTAAGTCGCTTTTGTCCTCATCTAATTTAACCGGACGCCAACCCAGCTCTCTACC 

GGGCAGGACTATTTGTTGCTCTGGACTCTGCTCGATTAGTGCTGTGGCTGATTGGGAAAGAGAGTATTTGTCAGAATGGTTTTG 

TACAAAATTTTCCCATCCCGGCTCTGCCCTAGCTCCAGGTTTGTAATTTTTTAAGTAGCCATTCTCTTTGAGGCTGAACTCAAA 

TCGCGAGAACCAATAGAAAAAATCAAAAGCTCGCTCTGTGATTTCGGCAGATATATTCTCTTTTCGCATAAACCCTCCTTGATA 

GCTAGCAGCGAATTGGGCCTGTACCATTTTGCCTGCACTGCGTGACTAACTATAGTGGCTATAGCCCGCGCGGGTTGTCGAAGT 

CAGGCCGGCCGCTCTTGGTGGGGTAGCTAACCTTGCACCCTGTTGTTCGATAGCCGCTGCAACCCCAGAAGTCGAAGCCGCCTT 

TGCCGGCTTTCTTGCGGTGAACCAGGGGCAGACCGCACTTGCCGCAAAGGAAGTCGCTGGTCGGCGCCGAGGAGATACTGGCGG 

TGGAGCTTGCGCGTGGTTCGTCCTGTGCGCTGGGGAGCGCGCGGAGTGTCTGCAGCTCTGTCTGTAGTAGCTGGTAGAACTGCT 

GGACCACGTTCATGTAGGTGTCCTCGCTGTTGGCGATCCGGTCCAAGGCGACCTCGAGGTTGCGGGTGAAGTCGAGGCTGAGGA 

AGCTGAAGTTGTGCTCCAGGAGCGCGATGGTGGCCTCTCCCAGCTCGCCGGGGATCAGGCTTCGCCCCTTCATCTGCACCTGGC 

CCTTGTCGATGATGTTCTTCACGATCGAGGCGTAGGAGGATGGCCGACCAATCCCGCGGCGCTTCATCTCGCCCACCAGGCTGG 

CGTCGGTGAATCGCTTGGGAGGGGTGGTTTTCTTCTCCAGGACCTCGCCGCTGTAGACCTTGAGTATCTGGCGTGGCTCCAGCG 

CCGGGATCGGGATGGGGTTTGAAGGCGTTTCGTTCTTCTGCTCCTCGGCGTCATCACCCTGCAGCAGTTTTCTCCAGCCAGGCA 

CGTTCAACAGCTTCCCTTTGGCGCCGAAGCGCAGCGGCTTTTTGTCGGGGCCGACGCCCAGGAGGGTGATGGTTCTCACTGCGT 

ACACGGCAGCTTCGATCTGACTGGCAAGCGCGCGGACTCGAATGAGCTGGTACAGCGCCTGCTCATCAGCAGTTTCACCG^ 

CAGCGGCCATCCAGTCGGTAGGGGTGATGGCGGGGTGGCCTTCCTGGGCGTCTTGGTCCGCTTTGAACATCCGCTGTTGCTCAA 

CACACTTCAGCCCCAAGGCTTTGGCGACAGCACGGATATCCGGCATCGAGTCCTTCGAGATATTGGGGTTGTCCGTCCGGTG^ 

AGGTGATGAGCCCCTGTTCATACAGCCGCTGGGCGACCTTCATCGTCTTGTCGGGTGACCACTTCAGCGCATTCCCGGCGGCCA 

TCTGGAGCGAGGAGGAGATGAACGGTGCCGGAGGCGCATCGGTTTCTTCGCTATCAATGCAGGTCTCGACGATGACT^TT^ 

TAGCCCCCACCAGTTCTGCGAGTTGACGATCCTGGACATACGGGAACTCCTCGCTGGCAAACACGGGCACTGGATCCCATTC^ 

CCGTCCAGGTACGGCCGTCGCTGGGCGAAACGAAGGTCAGTTCCACCCCGAAGTGCTTGATTGCT6TGAAGGCGCGGATCTCCC 

GCTCTCGCAGGACCACCAGGTACACCGCGACGGACTGCACGCGCCCGGCGGTGGTCGGCCTACCCATCACGCGCCGCAAC^ 

GCGTGACCAGATACCCCACCAGGCGATCGATGACGCGACGGCATTCCTGCGAGGCGACCTTCGGGAGGTCCAGGCGACGCGGCG 

AGCTGAGTTCGGCGGTGATGCATGACTTTGTGATTTCTTTGAAGGCAACGCGCTTGTAGTTCTTGATCCCGGCAGCTTGGAGGA 

TGTGCCAGCCAATGCTTTCGCCTTCGCGATCCGGGTCCGATGCGACGTAGATTTCCACGGCTTTCTGCCGCAGCTCCTTCAGCC 

GTGCGACGGTTTTTTCCTTGCCCGAGAGGATCTGGTAGTGCGGTTTGAAATCCTGGCCCACGCCGACGGTGAGCATCTGCGGAT 

CCTGCCCGTGAACGGGTAGGTCTCTGATGTGGCCGGCTGTCGCCTCGACCTGCCAGGTCACGTCGGGACGAATCGAGGGGAGAA 

GGGACCTCAGCTTTTTCAGCTTGCCCGGGGCTTCAATGATTACGATGTGCATCGGCGGAATCTCTCGGAACAGTCGGTCGAACT 

GTCGCGATCAGAGAGCGGGCCAACAATGCGAAATAGACGGTGraTTGTTGCAGTTTTCGGGGCAACTTTCCGCG^ 

AAAATATCCGTTGCCGATCAAATCGGACACAACTTACCGATTCCTCAAATGCGGTAATCTTGTCCGTGTGGCTGCTGC^ 

CTACGGTAATCGTGGCAACTGGGTCGTCGCTCGCATAGATTCAGAO^GGAAATGATCGTTCCTGGAAGGACATO^ 

CAATCATCCCGATTGGTCGTAGGAAGGAAAATGGTATGGATCAAAGCCTTTGCACATGCATGCCAACGCC^ 

AGGAGCTGCGACTGTGCCACATGTTAGTCGGTAGAACTTTCCCGATAACATTGATCGCAGGCGACCATTGGTTGAGCTATGACG 

gcagcgcctggtgggtcgatgcggatgagccokigaasgaggacgaggtggcggctctgttggtc^ 

cgtgctggtgcggataggaaagcgcatcaggtacacccaagcccctcctaactcgtggggccaaggccgtaggtgagccacgac 

ttactttgcccacaatgcgaaatggacggtgcgctgttgcagtcttcgggaggagctgaagggtaaagtcatcagaagtc^ 

tgctgttgctgctcctgggcaaaccctttcaaccaggacgcccgcat<:xk::aggatgactgtaggggcagtcgaacacccgtag 

cctttccgcgcggcctcagcaccaatagcccgtacagtccggtggggatcgttgaagtgtcgtctgtccagcttgtcggtatcg 

gtcgctctctttgacacactgcacctcgtactggtcgagatttccgattcggaagcccttgccaccagagactta^c^ 

cggccaaaattaaggatgctggtctatcagcttttgccgccataactgttcattgcggttgctccagtcgctggaatgcttgtc 

gaggctgcgggcggcgaagagaggttcttgccacctgcaacttcacgagtgggcgcgaaaaattaagaccggcgtttcaccgtc 

gcgtcggcaatgccttgggccatcctgatgaaggatgcgcgatcttcttcctccataagcgccatgcactctagaact 

tcgtcggcgctgaggagctgcctggto^cggctcatcgcgatcccgaaatgagcaatgtgagcctgcttgg^ 

ttcatggcggagtttcctgcggggggctttgtcxsaagctgcgggtggcmaaagtggtacttgccacc^^ 

ggcggccaaaaattaagccgcctgctacatggggatgtcgtcgtcgaaggcgctgtcgtagtcctgctgcgattgcgca 

ctgagcaggttgctgcgggacgttccgagactgttgatgctgggttggcaacagggtgaccrcgg 

ggaaatgoscgaggcttcgaccttcagcgcctggacttcttcgcctgactctttgtccggcc^ 

ttcgaccttgacgcx3catacctttcx3tgaacagttcggcgaagcgctggga^tcctgatgccaccattc^ 

gccgccgcgatccotgtagccaccttggca^tcxsggaatcgagttgtcgaacatc^ 

attgttgccgttggggaagctgcggtgctccggcgccgagccaatgttggcttcccagaagacgggggtagccatgatga^ 

ccagtcagtggtggttggtgctctgctgcagtcggcccatgaactcggcctcggcctggcccatcttttccgcgavct 

cctgcttgccgatggagtggatcaggctgatctgcatgttgagggtgatgcgctgcagctccatctcgtagaggtcttgc^ 

ggtattccxsgggtcttgtcctggcgcagcatttccgtca^caccccgacgcccatccgccgggatga^ 

gaaaggttgtgcgggcggatgtgcgctgctcgatgagtcgggtgggcagtagagtgaagggtgggcgao^ 

gtccctgggcaagttcaagcaatccctgctccatggcttcgcactgttccgccaactgggccagctccctatt^ 

gctttaaaaggccttttagtgaggcactgtgttgcagcgcctggaaggcagtttgttgc^ 

gagcgccttgctggtccttatcxsagctgccgcattccgatcagtcctcgagttcgacaaxstcagcgtc^ 

gtcagagcggcccosggtgatcggcggagcgaagttcgagcgtcgagtgccctcavggatgtcctgcgg^ 

ctcgtacatcttccgcgcggcttcggcgcgagcattgtttgcggcgaagtcgtcgcgagtggcgcccgagaact^ 

ggcc^vgaccaaacaggcttcgcagcaccgacgcaccttcgtcgagccagacctccatgtcgcggcgacas^ 
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CTGGGCCAGCAGGATTCGACGGGCGAGTTCGTCGTAGTTGGTCAGGAGATAGACTGCCTTGAAGCCGAGAGGGTTGGAGATGAA 

CAACGGGACCTTGACCGGTGTAACGGACAGGTTCTCGGAGATATCGAGGGTCGCGGGCAGCTTGGCCATGACGTCATCC^ 

CTGGTCGATGGCCTCGAGTGCGGCTTGGCAGCTCTCGACTTTCTCTTCGATCTGAACCAGCCACCAGTCGGAGTACGGATCGTC 

CTGTGCTGCCCCGCGGTGCATGCGATTCACGTATGCGCAGAAGCCGGAGAGGCCGAGGATGCTGTGCTTGGCATCCGACTTCTG 

CCGGCCGGTCCAGATGCGGGCGGCGTGATAGGTGTGAAGCGTCAGAGCGACATCGCTGCGGAGAGAGCCGATCTGAAGCCGATG 

GGTTTCAGCCATGGTGCTGTGTTCCTTTATAGGGAAGGGGGTCGGCCACATCGTGCTGATCGGGGTAGGAGCGCGCAGTAGGAA 

ATACATGGTTCGCCGCTGGGGAATTTCCCATGGCGACTCGGAGCAAGGCACCCAGTTCGCCGCCGGCGTCGAAGTCAGTT 

ATGGAGTGGTTCCAGTGGAACCACCCAACTCGCCGTCGGCGCCGAAGTCAGTTCCTGGTGGAGTGGTTCCAGTGGAAACACTCA 

TCTTGCCGTTGGCGCCGACTTGGGTATCTCTACCGTTGGTAATCTGGTTGGCAGTTCATGTAGTTGGCTTTGTTGCATAAAGCT 

CCCCACATCGCTTTTTCCGCCATTTCAAAGCGATACATCTCAACCGTAATGGTCAACACGGTTACTGAATACCCGAA» 

AGCATTAACAACAGCCTGAGTTCTCGGTCGTTCCACGCATCTGTAAAAAGTGCAGAAAGAACGAGAACGACCAAAAAGAAAGGG 

GCCATGCTCAAGAACATTGTTTGAGTTCCTCGTGAGTGGTGATGCCTCGCTTTGGCGAAGTGTTTCGGCCCGAAACACCCATCT 

TGCCGCCGATACCGAAGTGGATCCCTGGCAGAGTGATTCCAGTGGAATCACCCAGCTCGCCGGCGGCGCGGAGGTCGGCTCCTG 

GTGGAGTGATTCCAGTGGAATCACCCAGCTCGCCGGCGGCGCGGAGGTCAGGACCTGGGAGAGTGATGCGGGCCGCATCACC^ 

AAAACACGACGCCCGACCGAGACTTTCTCATGAATCATCACCCCGCTCGGATGGCACGCTCGATCCGTGCCTGGGCCGAATGAG 

GTCCTTGAGCCGGCTGAGGACCTCGCTCCCTGTCCGGGTATCCCCCCGGGGCTGGACCTGAGGCTGTGTAGGCGCTATGGGGCT 

TGATGGTGCCGGAGCACGAATGGGGCGTTCCGTTGCGGGGATGGTTCGCTCAGCCGGTTCTTCCGGAGCCCAAGAAC^ 

CTTGCCCTGGACTGCACGCTGGATGAGCGTCATGAGGTAGCCGAACGGATTGCTGACTGTTCCGCCGGCACAACGGTGCTGCCA 

CTGCTCGATGATGGGTAGCCGAAGATCCTCGGACACCCGTCTGAGCGCGCTTACTGCCTGGATCCGTTGCTCGGCCTCCAGTGC 

GTGCAGGAGATCCTGCCAGTTCGGATGGGCTTCCTCGCGTGCTCGCGGTTGTACTGGCTTTTTACATACAGAATCTTGTTTGTA 

TGTATACGTACTATATGAGTTCGGCATCCGAACCAAGGGCGTACTCGGCGGATTCTGTCX3GGGTTCGGTATCCGAACTCAGCGG 

TAGAGCGCATTGCTTGCCGCTGATTTCACTGAGTTCGGCATCCGAACTCGGGGTGGAATGAGGTAAATCCGACAGAGTTCGGAT 

GCCGAACTCTGCCGCTGGATCCGCTTCGAGCTGACTATCGATAGCCCATTGCTGGTGGTTCAAGCX3TCCCTCX3AG 

ATGGGTAGGGAGGCGGCGACCCACGTCCGGATCATCCCTGAACTCCCGCCyiGATGATCTGCCCGATTTCXSCGTATGGCrCGGOT 

GCCGTGACCGGTGGATTGGCTCAGCAACTGCATGTAGTCGGTGTCCAGCTCCAAGGCTTaSGCTGGAGAGAC^ 

AAGGATGTAAACGTTGCCCTGGACCTGTCCGTTGAGGTCGTTGCGCAGGCGTCXX3CCGAGGCTGAGCCAGCGGGTCAGACGAA 

GACCGTGAGTGCCTTGGCGATGGTTTCGCGCGAGGCGATCTTGCCCGGCTGCATACCGAGATAGGGGCGCAGTTGCTCGTATGT 

CGGGAACGCGGTCAGTCCGTCGTCGTtGATGAGCAAGCGGAAAACTTGCCAGCAGTTCaSCrCC^ 

CAGGAGGAGGCGCCGTGGAACGGTTTCATGAGGATTGCCGCTGAAAATGATGCCGGAATATTGTTGAGCCAC^ . 

TTCGGCAGCAAGCCGCTTCTTATACAAATGGGCCTCGAGATGCCCCGAGGCCGAGTCCAGCAGTGTCGAAATGGGA^ 

CAGCCTCATGGTGCTGTGTCGCCTTCCGACTGAACCAGGTTCAGTAGTTGTTGACGTTCTGTCTCAGCGTTCACCTGra 

GCGGATGGGGCAGAGTGGGGCAGGTGTGAGCTGACGCTAGCX3AGGTAAAGAGTGGATTGGGACTTTTGCAGGCCCGCCTGGC 

TGGCTCGATTTGCCAGACGGATAGAGCCCGTCTTTCAACCAGCTCTGGATGAGAGACCAAACAATGGCGAGGCT 

GGACTGTCTTGTTOSATTCCGGCGTTGATTTCCTCTGTGAGGATCATCXSaSATGTCCAGC^ 

ACCTGATTTTCCTGCATCAGGTGCTCCCAGCGGAACCATATCTGGCGCTCGATCTGTTCATCGAGCTGCCGCCAGCGAC 

CGCGGGTGGATCTCCAGCATCAAGCGCTTGGTGGCGGTGTCCTCCGGCGACAGACCGAAAAAGCTCTGGATCATCGTTGT^ 

GCTCCCAGCTTCAGTGCGCGTTCGATCTGCAGGTCTTCCTGC6CGCTGCGCTCGGCTGTGGCCAGGATTTTCTCCATGACGTCG 

ATGTTCACTCTGACATCTACCCAGGAAACCGGGGTATTGGTCy^GGATGCTCATGACGGCAGGCTGTTGG^ 

TCCGGCCGCAGCCCCATCTCGGCACAACGCCTGAGCTGGCCGTTGCGCAGGTGGTTGAGCACCTGCGTCAGGACCGC^ 

ATCGTCGACTTCGACATGTCGATCCCTCCCCAAATCGTGCGAAATCACTGGTCAGTAGCTGCTTGCT 

AGCTCAAGGTCAACCAGGCGTCGGGCAAGACGAATGATCaXSAACAGTTTCACC^ 

CAGGATGGCTTGTCACTTGTGGTGGGTTGGCCCAGCAGCAACTGAGCGAGGCCAGCTGAGAACTGGAATTCACCCAGTGC^ 

CGGCAGGTCTGTTGATCGTGAGTGTTCAGAACGACTTGGACCTGCCCGCTGACCGCTTGAAGCAGCGTCATGAGGTGC^ 

GAATCAGTAATCTCATGGTCTTCATTGGGCTCGCGATACGTGTAACCCACACCGCCCTGAATCCTGACGATCrCG 

CCGACATGCAGAGCGATCTCTTCAGCCAGATaGCGATGTGCTGTTOCAGGATCTCGGGGGTATCGATCGACCGCTCC^ 

CAGAGATCGGTGATCGGGTGCTVGGCCACCGACTTGCACTGGGATTGCCTGCAGAGCTGTTTCATCGAAGAC^ 

CCCTCGAGGCCGGCCAGCCGTTGGCXSAGTCTGCTGGATCTTAGTCGATACCGGGCTCAaSATATGGCCTGCC^ 

CGCTCCTCATCGGGCAATTGCTTTTGTTGGA»GGTGGTGGCGGAGCCX3GCGGGGGGAGGGGATTACCCGGGTTGC 

CTGGACGTTTGGGGATCTGCGGCGGGGCTCTGTGTTTGTGCTTGTCCAGTAGACGACGCAGCAGGGTTTTCCTGCCC;^ 

GGGAGTTGAGGTGGTGCGGCAGGTGTGGGCAGGTCGACCAGAGTGCCGCGGCGTTGCTCCTGCTGCCCGTTGGTAATCrC 

AGGATTTGGTCATAACGCAGGCCCAGGGGGCGCTTCATTTGGTCGATGAGTTCGTCCTGGAAACGCTCGAAAAT 

GGGCTACTGTCGAAGATTGCCAGGGTGTCCTGGAACAGCATTTCGAAGTCAACCCCTTCACCAGCGTATAGACGTGCCCAGC^ 

GAGGATGCCGACTTTCTGAGTGACAGGAGTTTCTCXSATTTGCGGTTTACCGAGCCCTGAATACJ^GCATCAC^ 

GCCAATACCTCa^TAGTGTCTAACATCTTACTGATATGGGATTGAGATACAGGGTAGCCGTCCGCrT^ 

TCGCGCTGGGAAATGCTTTCACCGCCGTTCTCTTGTTCGTAAAGAAATTTCGCCTTCnXSCACCCCAAC 

TTGAGGTCGCCCTTCAGATCGTTCTCGGCCAAATGACCGGTa^GCGCGATGATTTCGCCGCGCTGCTTGTCCCAGGGCTT^^ 

AGGCAGTCGAAGCTG/W^TAGCGCTCGTCTCCGGTCTCTTTGTAGAGGTCGTTGAGAATTTCCAGGCGAGTGTTTCC^ 

CGAATGCGGTATTTGtCCTCTCCAGGGCGTCGAGTCACTGGTGGCXXSCGTATCGAGGCCTCXSATGTCGGATra 

TCATCGTATTTCGGGTTTCGAGTGGTCCGGGGGTTATCGTGCCAGGGGAGGACGTCGTGGAGTGTCAGCCTCATGGGOT 

CTGATCGGGTCCGACAACTGCTCGAGAACCTTGGGCGGTGCGGTAAAACCAGGGGOU^GTAGCCGGTCTTTCAGAT^ 

TTGATCTTGGCCATACT(^TGCCTCCGCCACCTGAGGCGTGTTCGCCGCCGCCAGCTCAGCCTGCAGGCGCTGGTTCTTGGCX3^ 

AGAGTTCGTCGACCATTCGGTTTGCTCGCGCGAGTTCTTCTTCGAGCGCGTTGCTGCGCTCCAGTCGGTAGACGATCTCGCT 

TGGCCGAACGGTGGTTATCCGCGGCTTTTTGCTTTAGCTGATCGCGCAAGCCCAAGGGCATACGGACAACGA^ 

TTTCTCGGGCCATGGTCATTCTCCAATTTGGGCCTTGCGCCGCTTGAGCTGGTCGCGGAGTTGAGTGGGGAGGCTGCTC 

CAGCGTGTCGCGCTCCTCGTCGTACrCGATCCTGGAGCCCAGCAGGTGCGCCTCXSAAGCTGATCGACTVGGCCT 
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FIG. 20 

GGTGAAGCGGCGGAACTGGTTGAGGGTGCGCTTGTCCGCCGGGATTTCCGGCGACAGGCCGTAGTCCTTGTTACGGATGTAGTC 

GTAGAACGCCCGCGGTTGCTGGTCGTCCATCAGTTCCGAAAGCGCGTCGAGGGTCATCGGCTCGCCGATGCGCGCCTGCGAGGT 

GGCGTAGTCGACCAGCGTCTCGGTCTTCTCGCGGGCCTGTTCCTCGGCCATGTCCTCGCTTTCCACGAAATCGCTGAAGGCTTT 

CAGCAGGGTGCGCGTCTCGCTCGGCGAATCCACCCCTTCCTGGCAGCCGATGAAGTCGCGGAAATAGTCGGAGACCTTCCTCCC 

GCCCTTGCCCTTGATGAACGAGATGTACTGCTTCGACTGCTTGTTGTTGCGCCATTCGGAAATGTTGATCCGCGCGGCCAGGTG 

CAACTGGCCGAGGTCCAGGTGGCGCGACGGGGTGACCTCCAGCGACTCGTTCACCGCCACGCCTTCGCTGTGGTGCAGCAGGGC 

GATCGCCAGGTAGTCGGTCATGCCTTGCTGGTAGTGGGCGAACAGGACGTGGCCGCCGGTGAAGAGATTGGACTCCTCCATtt 

CTTTTGCAGGTGCTCGACCGCTTCGCGGCTGAAGCCGACGAAGTCGCGGTCGCCCTCCAGGTACTCGCCGAGCCAGCCXSCn^AA 

cgggtaggccccggactcgccctggaagaagccccaggccttgttcttggcgttgtagctttcgttgaggxasgccagcaot 

ctcgatggcctgggagtcgcccagctcggcgtcgcgcgcgtggagcacggccggggtgccgtcgggcttcttctcgatcaggtg 

gacgatggcgtgtttgataggcatcagaaagactcctggttcagggaacgctgcctgtosgctcgaas^ 

ttgagctcgccgcgtgcgtcgcttggcgtggaagaaatgaagatcgaggcgcagccgatcttgctgagcttcaagtggccgcct 

gcggagcggtcgacgctccagccttctgcttgagcgaaggccaccagccggcgaacgctttcccgggcgccgcgaaggaaacta 

cgcattcgtggtctcctcgctgttgagctgcagccaggtctccatctctgcaagggagcggggaaggctatcggtggcggcggc 

atccaggagacgtttcaccagcgcctgaaggctgcgacgatcgctctgccggctgcaggcavggtcgtcatgcgc^^ 

ctvtgcggacctctgtgcgcagttgagtgcatacgtcgtcagttgctactggtgccgttgacgcgacaggtgaccxscgac 

cctgaggcggttttcca^ctcttcccttgctgccgagaagcxsatggccyvcgacctacactccaaccgctggtccag 

ctttccgcagtcagggcagggctgtgcgccgctgccatccaggtgatagcotcagttcagacagatccacc^ 

caacttgagctcaagcatcagctcgactttctggaagtgctosagcggccggtaggctgtgaastc^ 

ccttgggatttctacaggacagcgcccagcxsgctctcxsatgcgcttttcxstggtcgcctgcct 

atgcgttgtcctcgccaaacagtcgttcggcaaggcgggaggctgccaccttgtcgctccaggtgcaggtcgacgctgct 

gaatggtattggtccagaaaccggcgccatcgaacttgacgatgcactgataggtctgggacgccttcatgactcgg^ 

gctgagtgcatcaccgacctgaacggccaccagttcgggcaactccggctcatgcccgggatagttgttggc 

gaactoscgcgcggcgtccttgctgcctgcaatgcxxsaacacggaxst^ 

osgcaccaggctcactttcagcgagcagctgaacaggtgtcgcgggatccatacc^ 

cggagcgcgcgtagtgaccaccagagtccasgcctgggtggtasagatttcgatgtaccccagttcct 

tcgctcavtcgaactgggtgosaccgtgctctcgtcccgcxsaaaggcgcgagatcacttggctgag^ 

gttgaccco^cgtgattggcatgaaccactgccgagtcggtgaacgcggccgaggcgtggatactcgga;^ 

gttcatcaaagacagcctccaactgatggacgtcgtagcttccgcagcaggggcagaccyvacacctcggct 

TGTTCTCGGCGGGGAGAACCTGGCGCACCAAATGGTGTGGGTGGTGCTGGTCACAATTCATGCACTGGT^^ 

gggaagtgttcatgcatgggctccttgccaggtccccggcgccaggtcggcyu^gcgggtctggttggcgatgaa^ 

gacggtgccgatcggaccgttgcgatacttgccxsatgatgatttcggcgatgcccttggcctcggtgttggggtggtagacctc 

gtoscggtacacgaacatgatcacgtccgcgtcctgctcgattgcgccgctctctcgtaggtccgcgcagttcggtc^ 

cgtccggttctctaggccgcgatttagctgggatagagctacgacgggacagtccatctccttggccagcgct 

ggagatttccgagatctcggtggctcgattctccaggcctgggcaactcatcyvgttgcaggtagtcgaccm 

gtgtccgtacttcctggccgcccggcgaaccttggcgcgcagctctgtcggcgtgaggttgccctgatcgttgatgaccagccg 

gctgcavtagtcgtttatgcgctggatcgcgccagacagtctgggccaatcctcttcttgcagttggcccttcatc^ 

caggtccaggtggccgaacagggcggcaagcctgaacagcaactgctctgccggcatctccatgctgtacacct 

ctgttggtcx3ctctggagc(3cggtgta3accaggttgagggcaaacgacgttttgcccatcgcggggcgggc^ 

gagatccgacttctgtagtccgccggtgagtgcgtcgagatccttcaggccagtcgggacccccgttaccgtcacgttc 

gaagcggtaatcgatggtgtcgacgatcttcgtgagactcttgttgatatcgacgaaatcggcgttgtggtggtcctggccaag 

ggccgaacagcttctgctcaatctcctcctgaacctavgagggatttgcctggtggttcgaggcggtcctggtgcagt^ 

cgagagacatcagctgccgcaggtgtgcccgatcgcgaacgatctgcgcgtaagcctcxsatattggccacggagggcgtc 

cggcgagctggccgaggtaggccagcccgccagcttctggaaggtcttcgatcgcttccgacacagtcacgaca^ 

catccttcgcggccaactcgctgatggcggtgaagatcagccgatgctavtgccggaagaagtcctccttctgc;^ 

cgacaatgtcccatgccgcgttgtccagcatcaggccgcasagtaccccttgctccgcct^ 

ccagaacactcacttcgagtcctcccggctgatctgcagaaacgtgcttcctctgccgcccggggagccgatgatctcc^ 

tttcacttccaccttggcggactcgatgatcaccgaaccgamtgagccaccgctttcgcgcgctcgatatccaggg^ 

ttcgto^gct^cccttcgatggtggcaaagaggtgattccgcagatcctcaatcttgttcttavtc 

tgcgtttigagcttgctgatgacgcggattgcgctcttcagctcaggtggatagcgatggatggagttgcggcgcatgot 

cccgggtgatgagctccaggttgtcgatgacgacgttctgcttgttgttgtccttgaagcagacgcaatggcc^ 

ggccgaagtgttcttcccagagcaggatgtggatgcccttccagtcccxsggggggatagccggtatccgagatctttcgttgc^ 

ggtagccatcagcgctgacccgcgtgctgccgactgggagccatgtgtggggcttttgccccttcttgaactgcgtttctc^ 

CGCGTCCTCGCGOXSGCAGCCCCTTCATGCCGCAGTTCCATGGGGTGTTTCCCTTCTTGAACCGAGTTGCXjCTGCCCCAGTCC 

GCaSGATCTTCTTGGAGAACTCAGGGCTTTTTTTCAGCCCGAGCCGATACGCTCTGGAGCAGATCTGCTGGAGCGTTTTC 

ACCTGGCTGCCAGGACCTCGTTCGGCATATCCGGATACAGGCGCAGCAGCACCAGTTCCTGTTCGGraGTCCATC 

TGCy^GCGCTCCTCTCGGTTGCTGAATTTGCGGCGGGCACGCACCAGGGCCTCGTAGGCGATGGTaSAG^ 

TTGGGAACCTCCCTGCTCGAGCGCGCCCTGAACTCGCCGGCTGACACCGCGGTGCAGGCGAGTGTTTTTGCCGTGCTCC^ 

TGCTTGCAGCGAGGCCTGGTCATAGGCAAGGGCCTTCGTAAGCGCCGCCGGCTCCAGGTACTTCAACGCCGGATGGTGTAGTTC 

GAGGTAGGCCTTGATGGCTGCTTGAGTCGACTCGTCCGGCCTGCCAGCAAATTCACXSTACCAGCGAACGCACrcCGA 

GCCTTCGACGAAGAGCTTGCTGCGACGACGCTTGGTCGAGAGCTTGCAGCGCTTCTGTTGGGCGAO^TGAGCCXSAGCG 

CACCAGTTGGTGGTGGAGCXSTAGAGTAGGCGTGAGCGGCCAGCTCAGGGCCGATCCCTCGGCCCATGM 

CGCTGGCATCGCGTATGCCGCO^GGTAGTCGCAGCCGAAGGAACTGGCGCAGATCCCGGCCAGTTCATGCAGCCATGGGGC^ 

AGAGCGCCTGGTTCGAGTGGGAACGCACGCGGTTTCCACCGTGTGGGCCGAGATCTCCGCATCGCTGAGGTCG/^CT^ 

CAGGATCGCGGCCTGGCGCAATGCGGTTTCCGCCTCGTTCGGCGTGGCCCCCTTGGCATTCGCCAGGGCCATCAGCTTCTCGAC 
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FIG. 2P 

CTTGGCGAGGACTTTTTGCTTGTCCATTACGCGCCCCCCTTGCAGATGGTTGAGCCCTGCTCGGTGGCGGAGCGGTCCAGGCGC 

TCGATCTCGGCCAGCACTAGGGCGCCGGCGCGTACCAGGTTCTCGCGCGCCGACTTAGGCTTCCACCACTCTTCGTCCCAGGGC 

CAGAAmGTGCAGAGCCATTCCGAATGCCGCCCGGCmGTCGGTGCCGATGCCGCCGGCATGGAGCGCGTAGCAGCCGG 

AGGGCGATCAGGCCAGCGCTAGCGTCGTCGTTGCTGGTATCGAAACCTTCGACGGTGATCTGGCGCCGGCGCTCGGCCTGTACA 

TCGAGCCAGGCCTGCGGAACAGGGGAGAGGTTCGAAGTTGGGTAAATTCGGTCTGTGGCCTGTCTTACCCGATCGACAATAGCC 

AGGGCTGCTACGTCGCCCAGTGCGACCTCCAGGGCGGCGAGCAGGTTGGCCATCAAGAACTGCTCACGCTCGTCCGCGTGGTTT 

TTTGCGCTTGGGCGTCTATTCCAGTCGATCTGTGCTGAATACCTCGCTGTTACCGGGTCTGGCTCGCAGTAGGACGGCCCGATG 

TGAACCTGGCAGTCCAGGCATTCAATGGCACAGTTCCAGGCATCGGTGTCGAGCAGCTCG^ 

CTGCCGCAGAGCGGGCAGGGCTTGAGCTTGGGGTCACTCATGGCCTGCCTCCCTGCCCGTCGTCGGTGCTGGGCTTAGCTTGGG 

AGGAGCCGGCGACGAG(^GGTCGGCCAGGTCCGGATAGTTCGGATGGGTCCAGAAGCCGAACTGATCGCGCTCGACCAGGT 

CCTGAATCATAGGTTTCATGCCAGGCCTCCCACGCTCATGCCGAGCACCGCATTTGCCTCGGCGGCCAGTTGCTGGCCTTCAGT 

GCTTGATGAGCTGCCGACGAAGAGCATCAGCCGTTCGATGACCCTCACAGTGTCCTGCGAGGGTTGGGCGAACTCGG^ 

ATCGATGGCAGTAGCTATGTTCCAGATCGCCCTGTTCATGTCCCGCTGGATCACTTGGATCGAACTGGCATTGGCCGCGATGGC 

CAGTTGCGATTTCGTCTGCAACAGTAGGTCGTGGGCGATTTGGAGCAGTGCGTTCTGCAGGGACTGTCCACAGGCA 

AGCGCGAACCGGAGCGACCAGGCCCTGCAACT^TCGCAGCT^CCTTCAGCCTCGATTTGCAGATTCTCGGAAGCACCGGC^ 

GCCTGCTTTGGCCATGTAGCCTTGCACGTAGGCGTCGAACTGGGTCTCGGCGATCAAGCTCAGTAGGAGGTTGTTGCGGTTCTG 

GAGGTTCATATGAGGCCCTCCGGAGCCGATTGCAGAGCGTCTGTCCAGAGCCGAAGGGTTTCCTGCTCCTCGCTGGTCAGAGCG 

ATCTGTGCGGTGCAGTCAGCGACCTTCTGCTTGATGACGCGCAGGCGCTCCTCGACTTCACTGCCATCGTCGTACGCGATTTCG 

TTAGTCAGATGGCTAATGAAGTCACATCGCAGGCTCCCCACCTGACTAGCGAAATACTCGGCTTCACTGCGCCATCCGATGAGC 

TGCTTTGGACGCTCCAGCCAGCGTTTGAGTTTTGAGTGGAGCTGGTGCGTGGAGTACAGGTGATCGAACACCCTGGCCCG^ 

TAGATCAACGCGATCTGCAGGATTTCCAGCCGATCTTCGAGCGTATTTCCCGGAAGCGGGTAGTTCGAGGCGACCAGTTGCTTG 

GCCAGGCGCTCACGTAGCTCGCGCCGCTTCTTGGCTTTTTTCTCGTAGCGATCACGCTTTTCCTTCGCATGCTCGATCTGCTTG 

TTCACCGAGTCGAGCAAGCGCGCTGCGTCTTGTAGGAGCTGCTTTTGTTCGAAGCTCAAGTAGCT^CCCAGC^ 

TCGATACCXX:CACCAGTCAGCTTGCGTATTTCTCGCGTAGCGCCGGTCAATTTGGCCTGCTGGCTCCGAAGGTCGGC^ 

CCGAACTTGTTCAGTTCGAGCAO^GTTCCGCGTCGACGTCGCGGGCTTGGGTGATAGGAGTCTTCGCCATGTC^ 

CTTGACCAGTGCTGCAACCGCCTCCCGGCGTCAGCGCCAGGAAGCGGTCAGTCCACTCGGGAAAGACCTCGATGGCC^ 

GAATGATTTCCAGCGCGGGCGCTGATGTGCGATTGGAGGGTTGCCGCGTTTCGAGGCGGTGCGCTGGTAGCCraCGCGATGCTG 

CGTTGCGAAACACGACGGCATCCGGGACAGTCGTTTCGAGCACAGAAATGTCCTGATGCTCATCGAAGAT 

CGTGAATTGCCCGGGAGTCATTGGTCTGGTCCAGGCAGTTGATGACGATCTGAACATTGGGGATCCGCATGCCXSA 

AGGGGCGTAGGCOTTCGAGCATTTGCATGGTGCCGCGGTTGAACTCACGGGCGGTAAGCATGTTGGGTTGGAG^^ 

CCAGGTCCX3ATGCAAGCAaVACCATTTCGAGCAAAGCTGAGCGCGCACCCTGGGTGT 

TCAGAGCGGGCATCAGGTTCGCCAGGCGTAGCCGGCCATCGGGCGCCTGGAGCAGTAGGTTGTTGAGCTG^ 

TGGAAATCACGAOSTCmGATTGGGGATAATCGTCCTGGAGATGATCCTCGCCGGGTCCGTTATGTTGGCGGC^ 

AAATGCCGCCCTGGGCAACTTCCGGCAGCTCGTAGTACGAGGATAGGGAGGGCTGGACGGGGTCCAGATCGATG^ 

GTATGCCTGCATOXSCGCAAAATGCACCGAGGTTGGCGGCGGTGGTGGATTTTCCTACACCACCCTTG^ 

TCGCTTTCXSCATTCATGTCTACAGCCTCTAATGTGGGGTCATTAGAGGCTGCTAAACAGACTCGTTGTAACGGGAGTGGi^ 

CTTCCTAGAGAAACCTACGACCAGTTGTCTTTTAACCAATTGGTCGTAGGTTCGAATCCTACACGACCCACCA 
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FIG. 3 

RL024 

DNA sequence (SEQ XD NO: 3) 

GTGGCGCTGACCGGTAATCCCCTCCTGAAATTGCTGGTCGTCCCCGTCGTGATCGGCGCCATC 
CTGATCGGCGTGAGCATGATGGGCAAGAAAGAAAGTGCGCAGTCACAAGGCGCCGCAACCCCG 
ACGGTAACGTCGGAAGAAGCGGCAACCCTGGGCATCGACGGCGACACGCCCGCCGACACACTA 
CGCACCATCGTGGCGGAAAGCCGGCAGCTCAAGGACCAGATCAGCAAGGTGATCCAGGAGAAT 
GACTCGCTCAAAGCCGCCAATGAGAACCTGCAGGGCCGCCTGCGCAACATCGATCAGAACATC 
GAGCAGAAGCTCAACAACACCGCCCAGGAACTGCAGCAACAGCAGGAAAACCGTAGCCAGACG 
ATCCTGGACCAGGTACAGAAACGGCTCGAGAACCTAACCCACATTCCCGAGGCCGGTGACACC 
GACCTGCCCGTAGGATTCGGCGTGCGACCAAAGGATGGCCAGCACTTTCAGGGAGCGGGCTCG 
TCTTCATCGGATATCGTCTGGATCGAGCCCCAGGACGCCCGCGCGGTTGATGCCAATGGCCAG 
CCGCTGGCCGCCGGCTCCACCACCCAACCGAGCGGATTCAGCTTCCCGACCTCCTTCGGCAAT 
GCGGTCGATCGCGGACAGAACGCGCTGGAGCGGATCGATGACGGGCTGCACCCCGTCGGCCAA 
CAGCGATCTGACCTGGAAAACCGCAAGCTCGTCCGTAAGACCTACACGCTGCCGCAGAACTCG 
ACGCTCATGGGCTCGGTGGCCATGTTTGCGCTGATCGGTCGTGTGCCGGTCGACGGGACGGTC 
AATGATCCTTACCCGTTCAAAATCCTCATCGGCCCGGACAACCTCACCGCCAACGGCATCGAG 
CTGCCGGACGTCGCCGGCGCGGTAGCCAGCGGGACCGCCTCGGGCGACTGGACACTCTCCTGC 
GTGCGTGGGCAGATCCGCAGCCTCACGTTCGTGTTCAACGACGGGACCGTGCGCACCTTCCCG 
GCGCCGGCCGAGGAGGTGAATGACAACCAGAGCAACAACAACCAGACCGCCAGCGCCGACCAG 
AAAACCATCCAGGGCGGCCTCGGCTGGATCAGCGACCCCTACGGCATCCCATGCATCGCCGGT 
GATCGCCGATCCAATGCCAAGGAGTACCTGGGCAATCAGAGCCTACTCACGGCTGCCGGGGCC 
GGCATTGCCAAGCTCCTGGACGCCGACGAGAACAACACCAGTACCGTCTTCAGCGGCAACGGC 
ACCAGCTTCGGGACGACCGGAACCAACAGCAACTCGGCCCTCAACAGCATCCTCTCCGGCGGC 
GTCAGCGACATCCGGCAGTGGATGAACAAGTTGTACGGGGAGGCCTTCGCCGCCGTCTACGTG 
CAGCCAGGTGCGCGGGTCGCAGTGCATCTCGATCAGCAACTGGCGATCGACTATGAACTCAAG 
GGCCGCAAGGTCGATTACAGCTCTGGAGCCGCTCATGCAACAGCAGACTTGGACTAA 

Protein sequence (SEQ ID NO: 127) 

VALTGNPLLKLLWPWIGAILIGVSMMGKKESAQSQGAATPTVTSEEAATLGIDGDTPADTL 
RTIVAESRQLKDQISKVIQENDSLKAANENLQGRLRNIDQNIEQKLNNTAQELQQQQENRSQT 
I LDQVQKRLENLTH I PEAGDTDL PVGFGVRPKDGQHFQGAGS S S SD I WI E PQDARAVDANGQ 
PLAAGSTTQPSGFSFPTSFGNAVDRGQNALERIDDGLHPVGQQRSDLENRKLVRKTYTLPQNS 
TLMGS VAMFAL IGRVPVDGTVNDPYPFKI L I GPDNLTANGI EL PDVAGAVASGTASGDWTLSC 
VRGQIRSLTFVFNDGTVRTFPAPAEEVNDNQSNNNQTASADQKTIQGGLGWISDPYGIPCIAG 
DRRSNAKEYLGNQSLLTAAGAGIAKLLDADENNTSTVFSGNGTSFGTTGTNSNSALNSILSGG 
VSDIRQWMNKLYGEAFAAVYVQPGARVAVHLDQQLAI DYELKGRKVDYSSGAAHATADLD . 
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FIG. 4 

RL025 

DNA sequence (SEQ ID NO: 4) 

ATGATCCGGAAGTCGACAGGCTCGCTCTTGCTAATGCTTGCCCTACCCACACTGGCCCACGCG 
GTGGAGATTCTGCGCTGGGAGCGCATTCCGTTGGCCATTCCATTGACGGTCGGCCAGGAACGC 
ATTGTTTTCGTCGACAGAAACGTGCGAGTTGGGGTTCCTCGGGATCTGCAGGGCAAGCTGCGC 
GTCCAGAGTACCGGCGGCGCACTCTACCTGCTCGCCAACGAGCCGATTCCTCCAGCGCGCCTG 
CGCCTACAGGACGCGACCAATGGCGAGCAGATGCTCATCGATATCGCCGCCACCGAAGCAACG 
GCCGACCAACAGCCGCGCGAGCCGGTCAGGATCGTCGCCGGCGAGCCAGTGGATCCGCATTAT 
GGCCAGTCCCGGGAAGCCCAGCCATCGGCAGCAGCGAAACAGACCGAGCACGCAGAAGCACCG 
AAGGCCGTGCCGCGCGAAACGCCCGTCCCCGTGGTTCTGACGCGCTATGCGGCGCAGATGCTC 
TATGCCCCGCTTCGCACGGTGGAACCGGTGGATGGCGTCGGTCAGGTGCGCGTCAAGCGACAG 
CTCGACCTGACCACCCTGCTCCCCAGCCTACCCATCACGGCTACCGCCTTGGGCGCCTGGCGG 
CTGGACGACTACTACATCACGGCGGTGAAGCTGCAGAACGCCAGCGCCCAGCACCTGGCCCTG 
GATCCCAGGGACCTGATGGGCAATTTCGTCGCCGCGACCTTCCAGCACCCGTACTTGGGGCCC 
CGGGGCGACGCCTCCGACACCACTACCGTGTATCTGGTGACGCGCGGCCGCGGCCTTGCCGAC 
GCGCTCCTGCCCTCCTCCATCAGCCAGATCGATCCCAAAGGAGGCCGTCGTGGCGCTGACCGG 
TAA 



Protein sequence (SEQ ID NO: 128) 

M I RKS TGSLLLMLAL PTLAHAVE I LRWER I PLA I PLTVGQER I VF VDRNVRVGVPRDLQGKLR 
VQSTGGALYLLANEPIPPARLRLQDATNGEQMLIDIAATEATADQQPREPVRIVAGEPVDPHY 
GQSREAQPSAAAKQTEHAEAPKAVPRETPVPWLTRYAAQMLYAPLRTVEPVDGVGQVRVKRQ 
LDLTTLLPSLPITATALGAWRLDDYYITAVKLQNASAQHLALDPRDLMGNFVAATFQHPYLGP 
RGDASDTTTVYLVTRGRGLADALLPSSISQIDPKGGRRGADR. 
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FIG. 5 

RL026 : DNA sequence (SEQ ID NO: 5) 

ATGAGTTTCAGAAAACACACTGCGCAACAGCAGGCACACATCAACACCTTCCGGTTCATCACC 
GGCTTCCTGTGCATGGTCATCGTTGTGCTGGCCTACTGCGTCTGGGAAGCCCGTAAGGACCTC 
TGGATCCACATTCCGCCCGACTTGCGCTCAGGAAGCACCCGGTTGTGGTGGGACATTCCGCCA 
GAGAGCGTCTATGCGTTCGGCCTCTACATCTTCCAGCAGGTGCAGCGTTGGCCCAAGGACGGC 
GAGGTGGACTACAAGGGAAACCTGTTCCGCTACGCTGCCTACCTCACTCCCTCCTGCAAAGTC 
TTCCTGGAGAAAGACTTTGAGTTTCGTCGTAACGCCGGCGAGCTCAGGGGTCGCGAGCGCACC 
ACCTCGGAAATCCCCGGTCGAGGCATTGGCGAGAGCAATGGCCGCGTGATCCAGCACTCGATC 
AATGACTGGACCGTCAACTTGGACATGGACAGCACGGAGTATTACGCCGGCGAGAAGATCAAG 
CGGGCGCTGGCCCGCTACCCGTTGCACGTTATCCGCGCCGACGTCGACCCGGAAACCAATCCC 
TTCGGCCTGCAGTGGGACTGCTACTCCGACACGCCTCAACGTATCGAGCTTGAGGAGCCGGCC 
GCCCCCACCAAGCGGGAGGGAGGTCTATGA 

Protein sequence (SEQ ID NO: 12 9) 

MSFRKHTAQQQAHINTFRFITGFLCMVIVVL4AYCVWEARKDLWIHIPPDLRSGSTRLWWDIPP 
ESVYAFGLYIFQQVQRWPKDGEVDYKGNLFRYAAYLTPSCKVFLEKDFEFRRNAGELRGRERT 
TSEIPGRGIGESNGRVIQHSINDWTVNLDMDSTEYYAGEKIKRALARYPLHVIRADVDPETNP 
FGLQWDCYSDTPQRIELEEPAAPTKREGGL . 
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FIG. 6 

RL027 

DNA sequence: (SEQ ID NO: 6) 

ATGCCCGAAGAACATCTGTTTCAGGATGGAACCCTCAGCTTCCTGCCGACCCGTTTGAACCGG 
CAACCGGTAGTCATCGGCGGCCTGACCGCAGACGAAATGTGGATCACGGTCTTCACCAGCGGA 
GCAGCCGGGTTCGTTCTTGGCATCCCGGCTGCCTTGGTCGCAGGTAACGCTGCCTGCATTCCA 
CTGGGCGCGCTGCTGGTCGGCGCCCTCGGCCTAGGTATCGGCAGCCGCGTCCTGCGGCGGATG 
AAGCGGGGGCGGCCCGATACCTGGTTCTACCGCCAGGTGGAGATGGCCCTCTCGCTGCGCTTT 
CCCGTCTTCGGCAACCGTCGCCTGGTTACGCGCTCCGGCGCCTGGACCAGTCGACGCACGGAG 
TCCCCATGA 

Protein sequence: (SEQ ID NO: 130) 

MPEEHLFQDGTLSFLPTRLNRQPWIGGLTADEMWITVFTSGAAGFVLGI PAALVAGNAACI P 
LGALLVGALGLGIGSRVLRRMKRGRPDTWFYRQVEMALSLRFPVFGNRRLVTRSGAWTSRRTE 
SP. 
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FIG. 7 

RL028 

DNA sequence: (SEQ ID NO: 7) 

ATGCTGAAACTCACCCTCCAGAAACTGTCCGCCCTCTGCCAGAGCCTGGCCGCCATCACTTTG 
GCGCTCCCCGGTATCGCCTTGGCTGCACTCCCCAAACCCGAGGCACCTAGCCGTGGGGAGGGA 
TCGGGCATCATGCAAACCATCCAGAACTTCGGCTATGAGGGAGCGATGCTCCTCGCGCTGCTC 
ATCTGCGCGGCTGTCTTTCTGGGGGTCGCTTGGCATACCTACGGCACCTATCACGCCATCCAT 
GACGGGAAGAAGAAGTGGTCGGATCTCGGAGCGGGCGTAGCCGTAGGTGTCGGCCTGCTGATC 
TTGATCATTTATCTCGTCACCAAAGCCACCGCCATCATGTAA 



Protein sequence: (SEQ ID NO: 131) 

MLKLTLQKLSALCQSLAAITLALPGIAIiAALPKPEAPSRGEGSGIMQTIQNFGYDGAMLI^ 
I CAAVFLGVAWHTYGTYHAIHDGKKKWSDLGAGVAVGVGLL I L 1 1 YLVTKATAIM 
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FIG. 8 

RL029 

DNA sequence: (SEQ ID NO: 8) 

ATGAGCATGAGCGGAGCCCAGACATCAGCGTTCCAGGCCGCCGCTGGCTTTCCCCCATCGGCC 
GGCGAGGGACTGTTCATTGGAGCAGCGATGACCTTCCTTCTGCTGTGGTCCGCCTGGGCGATG 
TACAGCACCTGGCGCGGCTGGGCCACCAACAACCTTCGACAGCGCCACCGGTGGCGCTTCCGC 
GATCCCGGATCTTGGTCCTCCTCGGCATCACCTCTTTCTTCCTCCTCAGCTGACCCATACGGA 
GACACTCATGCTGAAACTCACCCTCCAGAAACTGTCCGCCCTCTGCCAGAGCCTGGCCGCCAT 
CACTTTGGCGCTCCCCGGTATCGCCTTGGCTGCACTCCCCAAACCCGAGGCACCTAG 



Protein sequence: (SEQ ID NO: 132) 

MSMSGAQTSAFQAAAGFPPSAGEGLFIGAAMTFLLLWSAWAMYSTWRGWATNNLRQRHRWRFR 
DPGSWSSSASPLSSSSADPYGDTHAETHPPETVRPLPEPGRHHFGAPRYRLGCTPQTRGT. 
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FIG. 9 

RL030 

DNA sequence: (SEQ ID NO: 9) 

CTGATCTGCACGAGATTCGCCGTGAACACTCCACATCCATCCCTTCGCCGAAGCTGCCTGGCC 
GTCTTGGCCTGCAGTGCGCTGGTCGCACAGGGAGCTTTCGCAGCGAGCGCCTCCGAGCAGGCG 
AACCTGGAGGTGATGATCCGGCAGCTCAACGCCCTCGAGGACACCGCCCGCCGCAGTGCCCAG 
GGCGCCGATGAGCCCGGACAGCGCTTCTACTTCGACTACCCGCGCCTGGCCGCTGACCTGCAG 
CGCATCCGCCAAGGCCTGCAGGACTACT^TGACGCCCAGCCGCGCCCAACCGCGTGACCCTTCC 
GACTTATCAGGGAATTACACCCTGCGCGGAGGGCCGATGCCATGA 

Protein sequence (SEQ ID NO: 133) ^ 

LICTRFAVNTPHPSLRRSCLAVLACSALVAQGAFAASASEQANLEVMIRQLNALEDTARRSAQ 
GADEPGQRFYFDYPRLAADLQRIRQGLQDYMTPSRAQPRDPSDLSGNYTLRGGPMP 
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FIG. 10 

RL031 

DNA sequence: (SEQ ID NO: 10) 

ATGAGCATAAAACAGCCCTTCGAATACCATGTCGAGAACATCGTCATTCCCTACAAAACCCTC 
ACCAAGGGCGTCGCGATGTTCAAACACAAAGAAGACACCTTGGAACCCGACGACCACGCCTTG 
CTCAACCCTCTGCGCTGGGCCGAGGTCGTGCGTCTGGGCCAGGAAGGCTGGGAGCTGGTGAGC 
GTTCAGCCACTCATGCGGGGCGTAACCGAGATCGGTAATCAAAACGCCCAAGGCTGGGCTTGG 
GGCGTCGCTCTGCCCGTCAGCTACCTGCTGTTTTTCAAGCGCGCAACCTCATAA 

Protein sequence: (SEQ ID NO: 134) 

MSIKQPFEYHVENIVIPYKTLTKGVAMFKHKEDTLEPDDHALLNPLRWAEWRLGQEGWELVS 
VQPLMRGVTEIGNQNAQGWAWGVALPVSYLLFFKRATS. 



Title: VIRULENCE-ASSOCIATED NUCLEIC ACIDS AND 
PROTEINS AND USES THEREOF 
Applicants: Laurence Rahme et al. 

Filing Date: September 12, 2003 Serial No.: Not Yet Assigned 
Page 26 of 118 Customer No.: 21559 



27/118 



FIG. 11 

RL032 

DNA sequence: (SEQ ID NO: 11) 

ATGCTTAGAAACATCTCTATTGGAGTTTTGCTAGCCATGGCTGCTATGTTGGGCAGTTATGGG 
GTGGCTGCCGCTACATTACGATGCGGGTCGGCAATTGTTAGTGAGGGCGACTTGATTGATGAT 
GTGCTTAGAAAGTGCGGCAACCCTGATAGCCGTAAAATTGAAGGGCCCGCAGTGGATGGTAGT 
GGCTATATAGTGCGGGGGGCTGCTACTGTCGAAAACTGGGTATATGGACCAAGGAATGGATGG 
TACCAGAAGCTTAGGTTTGTCGATGGAAGACTAGTTCAGATAAAAGGCAGTATGGACTAG 

Protein sequence: (SEQ ID NO: 135) 

MLRNI S IGVLLAMAAMLGS YGVAAATLRCGSAI VSEGDL IDDVLRKCGNPDSRKI EGPAVDGS 
GYI VRGAATVENWVYGPRNGWYQKLRFVDGRLVQI KGSMD 
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FIG. 12 

RL033 

DNA sequence: (SEQ ID NO: 12) 

ATGAAACTTATCCTTGATTTCGACGGACGCCTTCTAAATCCAAGCAACATGCTAGAGGCCCTA 

TCAAAAGCAGGAAAAAATACAAGCATCAGCATAAGCAACGCGCAAGCATTAAATATAGAAACT 

CTTCTCAAGGCAACAACCACTGCAGAAAACACAAAAAACCTCTCAACAACTTTCAACGGCGCA 

GAGCTGACTGCTAACAACCTTCAGCAAGTCATAAACTCAGCAGGATCACTAACCAGAGTATCC 

ACAATAGCCGCACAAGCCATTAATATAAACACACTTCTTTCCGCAATATCTACAGCAGGCAAC 

TCAAAGAATTTTAGCGCAGAATTCAATGGAGCCCAACTCAGCAGCGACAACCTACTTAGAGCA 

GTAAATGCGGCAGGAACAAACACCAGCATAAGCGTCAATACCGCACAAGCGGCAAATATAACC 

GCCCTTCTTCAAACTATTCATGCAGCAGGTGACACAAAAACATTCAGCGCAGAGTTCAATGGC 

GCTCAACTTACTTCTU^CAACATTCAACT^GCTTTAGACGCCGCAGGAACCCGAACATCCTV 

AGCGTCAATACCGCACAGGCGGTTAATATAAGCACCCTACTAGCCCTCATCAACTCTGCCAAA 

GACACGAAAAAGTTTAGCGCCGACTTCAATGGTGCACAACTAACAGCAGACAACCTTCAGCAA 

GCGATCAGCGCTGCGGCCTCGGGTACCAATATCAGCGTCAACACCGCTCAGGCGGCGAATATA 

TCCACCCTTTTACAGGCCATCAACATCGCGGGCAACACTAAAAAATTCAGCGCCAACTTTAAT 

GGTGCCCAACTCACTTCAAACAACATCCAGCAGGCGCTCCGAGCGACAGGATCAAACACATCA 

ATCAGCATGAACTCCGCACAATCCGCCAACCAAAGCACTCTACTTGAACTTCTAGACATAGCA 

AGTTCCAGCAAGCAATTCCAAGCCAATTACAACGGTGGCATGTCTAATCCGAACAACCTACAA 

CAGATAGTTTTCCCGTGCAGGCGCCAGTACAACCGTGTTTATTTCCGACGCACAAGGCCTACC 

AATCGCAAATATCCTTACCCTTATATCATCTGCCGGATGAGACTTATAGCCGTGGATGAAAAC 

ACACCATCCACGGCTATACCCTAG 

Protein sequence: (SEQ ID NO: 136) 

MKLILDFDGRLLNPSNMLEALSKAGKNTSISISNAQALNIETLLKATTTAENTKNLSTTFNGA 
ELTANNLQQVINSAGSLTRVSTIAAQAININTLLSAISTAGNSKNFSAEFNGAQLSSDNLLRA 
VNAAGTNTS ISVNTAQAANITALLQTIHAAGDTKTFSAEFNGAQLTSNNIQQALDAAGTRTS I 
SVNTAQAVNISTLLALINSAKDTKKFSADFNGAQLTADNLQQAISAAASGTNISVNTAQAANI 
STLLQAINIAGNTKKFSANFNGAQLTSNNIQQALRATGSNTSISMNSAQSANQSTLLELLDIA 
SSSKQ 

FQANYNGGMSNPNNLQQIVFPCRRQYNRVYFRRTRPTNRKYPYPYIICRMRLIAVDENTPSTA 
IP. 
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FIG. 13 

RL034 

DNA sequence: (SEQ ID NO: 13) 

GTGCAGTGGACTCACGAACAGTCACCGATCATCCAGTCGAAGGCACCGAAGATCCTGGTGCGA 
GCCTTCGCAGGCACTGGCAAAACTACCACCCTGGTGGGCTTTGCCAGGTCGAACCCTACCCTG 
AGAATCCTCTATCTCTGCTACAACAGCTCGGTGGAGAAAGCCGCGAAGGGCAAGTTTCCCCGC 
AACGTAGTGTGCAAGACCGCCCACAGTCTGGCTCATGCGGTGTACGGCATTCAGTACGCCCAC 
AAGAAGACGAAGAACCTGCGACTGACCGATATCGCCCGCGGACTCGATACCCAAGACTGGGAG 
TTGGTACGTGACGTGCTGGCCACGCTGAACAACTACATGGCCAGCGCCGACGCGGAACTCGGC 
CGACCGCACTTCCCGCGCTTCCGCGACAAGGCGTTCCTCACCAGTGCTCAGGAACGCTTCCTC 
AAGCAGGGCCTGGACATGGCGCGAGTAGTCTGGAGGCGCATGGTCGATCTCCAGGACACCGGC 
ATGCTGATGCCCCTTGACGGCTACCTGAAGCTGTATCAACTGAGCAAGCCCGATTTGAGCCAG 
CGCTTCGACTGCATGCTCCTGGACGAGGGGCAGGACATCAACCCAGTGATCGCGGACATTGCC 
CATTGGCAGCGCATCAGAATGGCTATCGTCGGCGATCCCCATCAGCAGCTCTACCGGTTCAGG 
GGCGCAGAAGATGCCCTGAACAGCGACTGGATGGCCGGCGCCGAGGAGCACTACCTGACCCAG 
AGCTGGCGATTCGGCCCCGCGATCGCACACGTGGCCAACATCATCCTCTCCTACAAGGGCGAA 
ACACGGAAACTTCAAGGACTGGGTCCGCAGACGCTGGTGAAAAAGTCCCTCCCGCCGGACCTT 

cctcaccgcactttcattcaccgcaccgttatcggcgtcatcgagaatgccctgcagctggtc 
cgcaatcatccggagcccaaattccactgggtaggcggtatcgacagttactcgctgcgcgac 
ctggaggatctgtacgcattcagccgaggcctgcgccaaaacgtccagaacaagaaactgctc 
cgtgactaccgcgactacacccagtacgtggagatcgccgagatcagccaggacggtgagatg 
cttcgctcgatcaagAtcatatcgacctaccctgatctgcctgcgcggatccttgagcttcgc 
tcactgaccettgacgatgagctggacgcaacaatcaccctgaccaccgcacacaaggccaag 
gggctggaatgggatttcgtttgcctgtacgacgacttcaacgcggacccgctggcccccgac 
accgacccaggcaagcgcgacgatgagttgaacctgatctacgtcgcagtgacccgcgcgatg 
aagatccttggcatcaacagcctggtgctgtcgatcatgcagcggtacgtggacgacagaaaa 
ctgaaggagcagatagctagctgtaaaaaatga 

Protein sequexice: (SEQ ID NO: 137) 

VQWTHEQS P 1 1 QS KAPKILVRAFAGTGKTTTLVGFARSNPTLRI LYLCYNS S VEKAAKGKFPR 

NWCKtAHSLAHAWGIQYAHKKTKNLRLTDIARGLDTQDWELWDVl^TLNN^^ 

RPHFPRFRDKAFLTSAQERFLKQGLDMARVWRRMVDLQDTGMLMPLDGYLKLYQLSKPDLSQ 

RFDCMLLDEGQDINPVIADIAHWQRIRMAIVGDPHQQLYRFRGAEDALNSDWMAGAEEHYLTQ 

SWRFGPAIAHVANIILSYKGETRKLQGLGPQTLVKKSLPPDLPHRTFIHRTVIGVIENALQLV 

RNHPEPKFHWVGGIDSYSLRDLEDLYAFSRGLRQNVQNKKLLRDYRDYTQYVEIAEISQDGEM 

LRSIKIISTYPDLPARILELRSLTLDDELDATITLTTAHKAKGLEWDFVCLYDDFNADPLAPD 

TDPGKRDDELNL I YVAVTRAMKI LA I NSLVLS I MQRYVDDRKLKEQ IAS CKK . 
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FIG. 14 

RL035 

DNA sequence: (SEQ XD NO: 14) 

ATGTTCGGGTCGCTGATCGGCGCAATCATCGTGGAGTGGGTATGCCTGTATTTCTTCTGGCCT 
GACGCGGGCTGGAAGCATGCCCAGGCCATGTTTGAGTACGAACTCAGTTGGCTGTCGCAGGGG 
CTGCTACACAGCGTCGTCGTGCAGGAGCCAGGTCGAACCGCCACCTGGCTGGCCCAGTTGGCC 
TATGACTGGTTGTTCGTGAAGACCGGGATGGTCGACTGGATGACCAACATGACTACCATCGCG 
CAGGCCCGGCCACGGAGCCCGCTGGACGTTCGCTATCTCACCGCCCACGGTGTCTCCACGCTG 
CAGAACTACGGCCTGGCCGCGCTGTACACGGTGCTGACATTCGTCGTGCGCCTGGTGATCCTG 
GTCATGACGATCCCGTTATTCGTGATGGCCGCGTTCACCGGCCTGGTGGACGGCCTGGTGCGC 
CGGGACCTGCGCAAGTTCGGCGCCGGCCGGGAGTCCAGCTACCTCTACCACAAGGCGCGCGGC 
AGCATCATTCCGCTAGCGGTCGTCCCTTGGACGCTCTACCTGGCAATCCCCATCAGCATCAAT 
CCCCTGCTCATCCTGTTGCCCTGCGCCGCGCTGCTCGGCGTAGCGGTATGCATCACAGCATCC 
ACCTTCAAAAAGTACCTGTAG 

Protein sequence: (SEQ ID NO: 138) 

MFGSLIGAIIVEWCLYFFWPDAGWKHAQAMFEYELSWLSQGLLHSVWQEPGRTATWLAQIiA 
YDWLFVKTGMVDWMTNMTT I AQARPRS PLDVRYLTAHGVSTLQNYGLAALYTVLTFWRLVI L 
VMT I PLFVMAAFTGLVDGLVRRDLRKFGAGRESS YLYHKARGS 1 1 PLAWPWTLYLAI P I S IN 
PLLILLPCAALLGVAVCITASTFKKYL . 
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RL036 

DNA sequence: (SEQ ID NO: 15) 

ATGAAGTTGAAGAATTTCTTACAGCCTTTTGATAGCGGTTTCTCCACTCCGAGTGCTGCGCTCAAGCTGCTCCGCA 
TGCTCGGTGGCGCCTTGATGTTGTGCGTGCTATGCAGCCTGATATTCAGTGTGAGCATGGTTTTAAACCATCAGGT 
GTCCCTCAGTCGGCAAGCTATGAATGTGGCTATGTACGT^GCGCAGCTTTATTTCGAGCAGCGCGAGGCGTTGCTC 
AATCACTTGAGCGGCAATGTCGTGCCCTTGGCCGCGGGTAGAGCGCTCGTCAACGAAGCGCCGAACAATGTGAGCA 
TCCTGCCGTTGAGTGACGGAGGGCGAGGTCTGCTATTGACCGCTCGCACGCTCGGTGATCTCCGGGAAAAGCGGCT 
GGCACTGATGTATCTGGTCGATACCGACAAAGGCCCTCTGGTTTACCGGCTTACCGCCGATGGTAGGCCCTCGGCA 
GCGATATCCAGCACGATAACCAAAGAGGTGTACCGAGCCTTGCTGGCGACTCCGTCGGCGCCTGTTCACTGGGTGA 
CTGACGGTGGTACCCCTCAACGGCTGTACCTTTTTGAATCCTTAGGCGATGAGCCGGGCGAGGGGTGGCTAGGCCT 
GGAGATTCTCGGCGAAGACCTCGATTCGATGTTGCGCCGGAATGATGCCGGAAACTACATGCTGCTGGATCAGCAT 
GGGCAGGTCGTACTCGCTACGGACGCAGAGGCGCTGGGGAGCGGTGCGTCGCGGACGCTTTTGCGTGGAGACGGCT 
TCGGTTTCATCGGTGCTGGCCCACTGCCGCAGCATATGGTGCTTTTCCAGCACGTGGGGTCTTCGAGCTGGGATCT 
GATCTATCACATCGGTATCGGTCGCCTGTTGCTGGCTCTGTGGCTCCCTCTGTTACTTGCCTCTGCGTTGGCACTC 
GCAGTCGGCATCCTACTGCATTGGCTGGTGCGGAGCATCGAGCGACGCTTGATAGAGCCCGCAAAGCGACGCCTTG 
AAGCATTGAAGGAGAGCGAAGCCTTTTCCCGTGCAGTTATCCAGGCCGCGCCCGTCGCGCTGTGCGTGCTGCGTCG 
TGCCGACGCCGCAGTGGTCCTGGAAAATCCCCAGGCGCGCCAATGGCTGGGTGATAGCGAGGCGATTGCCCACGAC 
GCGCCGAGATGGATTTCCCAGGCGTTCGCAGGAGGTGTGAAGTGTTCTGGAGAAGAACTGGAAACCGAGGCAGGGC 
TACATCTTCATCTCAATTACACGCCCACCCGCTATAACGGTGAAGACGTATTGTTCTGCGCCTTCAGTGAAATCAG 
TGCACGCAAGCGGATGGAGGCGGAACTGGCTCGCGCAAAATCCCTGGCGGATGCTGCCAATGAAGCCAAGACGCTG 
TTTCTCGCCACCATGAGCCATGAAATCCGCACACCTCTGTACGGCATGCTTGGCACGCTTGAGCTGCTTGGGCGTA 
CCGAGCTGAGTCGGCAGCAGGCCGGTTACCTAAAGGCAATCCAGCATTCCTCGTCGACCCTGCTGCAACTGATCAG 
CGATGTGCTTGACGTATCCAAGATAGAGGCCGGCCAACTGGACCTAGAGTGCGTGGAATTCTCCCCGCTGGAATTG 
ACCGAAGAGGTCGTGCAGTCGTTCACCGGTGCCGCGCAGGCCAAGGGGCTGCAGTTGTATACCTGCCTCTCTGCGG 
AGCTGCCGCTGCGCATGCGGGGGGCCGCGGCGTCGATCCGGCAGATTCTCAACAACCTGCTGAGCAACGCGGTGAA 
GTTCACCGACAATGGCTATGTCAACGTCCACCTGAAGGCCAGCGTGGTCGATGCCGAATGTGTGATGCTGACCTGG 
CAGGTCAACGATACCGGCATGGGGATCAACGTCGAGGATCAGCCGCGTCTGTTCGAACCGTTCTACCAGATACGCC 
GCTCCGAGCATCCGGTCGCAGGCACGGGCCTCGGCTTGTCGATCAGCCAGCGCCTGGCGCAGCTAATGAATGGCAG 
TCTGAAACTGGTCAGTGAGCTGGGGTTGGGCAGCAGCTTTAGCCTCAGGCTTCCGCTTGAGCGGATCGCGATGCAG 
GCTGAGCCGCAGGACCTAGCCGGGTGCGCCGTCCAAGTGCTGGCGCCTGTCCGCGACCTAACGGAATGCCTGTGTG 
GCTGGATCTCCCGCTGGGGTGGAAGGGCCATGGTCGCGACGCCGAGGTCGCTGGACGAGGCGGACGCGACCTCGCT 
GCTGGTCAAAGTGTTACTGCTGGAGGGGGCGCCGATGTTCGAAGCATGGCCAGGATGCCGGGTGGAGCTTTCCCCT 
CAGGGTGATATGGAGCCGCAGGCACAGGGCCGCGACTGGCTGCTCGGGCTCAACAACCTGAACGGCCTGCATCGTG 
CTCTGGGCCTGGCCCATGGGCGTCTCGCTGATCCTTCGACGCCGCCGATACGGCTGGCTCCGTTGCGCAATCTAGG 
TCTCCGCGTCCTAGTGGTGGAGGATAACGCGATCAACCAGTTGATCTTGAGGGACCAGATGGAAGCGCTGGGCTGC 
AGCGTGGAGCTGCTCTTCGATGGTCGCGAGGCGTTGCTGCACTGCCAGACGGCCTGCTTCGACGTGGTGCTCACCG 
ATATCAACATGCCGAACATGAACGGATACGAGCTAACCGCGGAGCTACGGCGCCAAGGGTTCCGGCAGCCGATCAT 
CGGCGCGACGGTGAACGCCATGCGTGAGGAGCGCGAGCGCTGCATGTCCGCCGGGATGAACGATTGCCTGGTCAAA 
CCGGTGGATCTGAATGCCCTTCAGAACTGCTTGATTAATATTCTCAAGGTGGATCGATGA 



Protein sequence: (SEQ ID NO: 139) 

MKLKNFLQPFDSGFSTPS7^LKLLRMLGGALMLCVLCSLIFSVS^WLNHQVSLSRQAMNVAMYEAQLYFEQRE 

NHLSGNWPLAAGRALVNEAPNNVSILPLSDGGRGLLLTARTLGDLREKRLALMYLVDTDKGPLVYRLTADGRPSA 

AISSTITKEVYRALLATPSAPVHWVTDGGTPQRLYLFESLGDEPGEGWLGLEILGEDLDSMLRRNDAGNYMLLDQH 

GQ VVLATDAEALGSGASRTLLRGDGFGFIGAGPLPQHMVLFQHVGSSSWDLIYHIGIGRLLLALWLPLLLASAIiAL 

AVGILLHWLVRSIERRLIEPAKRRLEALKESEAFSRAVIQAAPVALCVLRRADAAWLENPQARQWLGDSEAIAHD 

APRWISQAFAGGVKCSGEELETEAGLHLHLNYTPTRYNGEDVLFCAFSEISARKRMEAELARAKSLADAANEAKTL 

FLATMSHEIRTPLYGMLGTLELLGRTELSRQQAGYLKAIQHSSSTLLQLISDVLDVSKIEAGQLDLECVEFSPLEL 

TEEWQSFTGAAQAKGLQLYTCLSAELPLRMRGAAASIRQILNNLLSNAVKFTDNGYVNVHLKASWDAECVMLTW 

QVNDTGMGINVEDQPRLFEPFYQIRRSEHPVAGTGLGLSISQRLAQLIWGSLKLVSELGLGSSFSLRLPLERIAMQ 

AEPQDIJVGCAVQVLAPVRDLTECLCGWISRWGGRAMVATPRSLDEADATSLLVKVIiliLEGAPMFEAWPGCRVELSP 

QGDMEPQAQGRDWLLGLNNLNGLHRALGLAHGRLADPSTPPIRLAPLRNLGLRVLWEDNAINQLILRDQMEALGC 

SVELLFDGREALLHCQTACFDWLTDINMPNMNGYELTAELRRQGFRQPIIGATVNAMREERERCMSAGMNDCIiVK 

PVDLNALQNCLINILKVDR . 
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FIG. 16 

RL037 

DNA sequence: (SEQ XD NO: 16) 

ATGAGCTGGAAATCCTATCGGGTGCTGGTGGTCGAAGATCAGCCGTTTCAGCGCGAATACCTG 
CTCAACCTGTTTCGCGAGCGCGGCGTGCAGTACCTGGTAGGTGCCGGCGACGGCGCGGAGGCG 
TTGCGCTGCCTGAAGCAGGACAGGTTCGACCTGATCCTCAGCGATCTGATGATGCCGGGC7VTG 
GATGGTATCCAAATGATCCTGCAACTGCCGTATCTCAAGCATCGTCCGAAGCTGGCGCTGATG 
AGCTCCTCGTCGCAGCGGATGATGCTCAGTGCCAGCCGGGTCGCCCAGAGTCTCGGCTTGTCG 
GTAATCGACCTGTTGCCCAAGCCGACTCTGCCCAAGGCCATCGGCCAACTTCTGGAACACCTG 
GAAAGATGCCTCAGGCAGAAGCTGGAGCCGGAAACCGACGAGACTCCGCATGGGCGCACGGCG 
TTGCTGGATGCCCTGCATAACGAGCAACTGGTGACCTGGTTCCAGGCTAAGAAATCCCTCCAC 
ACCGGGCGCATAGTCGGCGCCGAGGCGTTGATACGCTGGAGCCACCCGCAGCATGGCCTGTTG 
CTGCCCAGCTGTTTCATGAGTGATGTCGACGCTACCGGTCTGCACGAGGCGTTGCTCTGGCGC 
GTGCTCGAACAGACCCTGAACGCCCAGGAATCGTGGCGCAGGGCGGGTTACGAGATTCCGGTT 
TCGGTGAATCTGCCGCCGCACCTGCTCGATAACCAGGAACTTCCGGATCGACTCTATGAGTAC 
GTCGGCGCTCGCGGGGCTTGTACCAGCTCACTATGTTTCGAGTTGACCGAGAGCAGTGTCACA 
ACTCTGTCAAGTAACTACTATGCAGGTGCCTGTCGCTTGCGCATGAAAGGGTTCGGATTGGCC 
CAGGACGACTTTGGCCAGGGTTACAGCTCGTTCTATAACCTGGTCACGACGCCTTTCACGGAG 
CTGAAGATCGACCGCTCCCTAGTCCAGGGATGCGTAGAGGATAACGGCCTCAATGCAGCTGTC 
ATCAGTTGTATTGAGTTGGGTCACCGCCTGAATCTCGACGTGGTGGCCGAAGGCGTGGAGACC 
TGCGAGGAACtGAATCTTCTTCGTCGTCTTGGCTGCGACCGGGCGCAGGGTTTCCTGATTTCT 
AAGGCAGTGTCTGCTCGTGAGTTCGAGCGGCAGTTAAGGGAGGACGGCCCCAGCCTCCTTGTT 
TAA 



Protein sequence: (SEQ ID NO: 140) 

MSWKSYRVLWEDQPFQREYLLNLFRERGVQYLVGAGDGAEALRCLKQDRFDLILSDLMMPGM 
DGIQMILQLPYLKHRPKLALMSSSSQRMMLSASRVAQSLGLSVIDLLPKPTLPKAIGQLLEHL 
ERCLRQKLEPETDlSTPHGRTALLDALHNEQLVTWFQAKKSLHTGRIVGAEALIRWSHPQHGLL 
LPSCFMSDVDATGLHEALLWRVLEQTLNAQESWRRAGYEIPVSVNLPPHLLDNQELPDRLYEY 
VGARGACTSSLCFELTESSVTTLSSNYYAGACRLRMKGFGLAQDDFGQGYSSFYNLVTTPFTE 
LKI DRSLViQGCVEDNGLNAAVI S C I ELGHRLNLDWAEGVETCEELNLLRRLGCDRAQGFL I S 
KAVSAREFERQLREDGPSLLV . 
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FIG. 17 



RL038 

DNA sequence: (SEQ ID NO: 17) 

GTGAAGTCTGCTAGTGCCTTGGAGCACCSACAACy^AACTTTTGCTCAAATGGACAACCCTCTCGCAG 
ATCTGTGTGGTGGTGCTGACOSTATTGCTGTTCAGCATCTGTTACTGGTCGCTGGGGAGATTGTTTC^ 

GTCTCCTTCCACTTCACCCGTATGATGGATGTTATACGGGAGCATGAGGTATTTCTTGGGCGCATCGCTCGAAAAAGCGAC^ 

accacccagaagtacgactatgatotggtgcctttgcagcggcacttgttggcaaaggaa;^ 

CGGGAGTTTTCCTTTGCTATGCCATTTCTACTGGCTACCAAGa^CGCGTTGAGCGCCGATTCCTCGGGAGATCCGTT^ 

ggtgtattgctcgccaatttctacggaagcttctggagtgtttccgcctatcccgcgccacagttactgatctttgatctttcc 

GGCAGCACCCGCCTGGCAGTGCCGTCGATTCCCTCCTVCAGCGCAGCGTGACAGGTTGAGCGGAAGCTATCCGAT^ 

cgcattctggcgcgcttgcgcacccggccggtgggggaggacgctcagcgtgtccattggatacgcgctgatcgctatcgcgac 

TCGGCGCTGGAGATGTTGGGAGTCGCCCGGGTTGATCTGCCGGAAACACTCTGGTGGCACGACGAGCCGAACCATCTGATCATC 

GCTGCGAGCCTGCTTGATCTCAGGCGAATCy^TGACTTCGAACAGTTGGTTGAGCGCCCGGCATTCGATTCGTACAGC 

TCGCaXSATGGCGAGGTATTGCTCGGOSCGGCCCCTGCGACCGGCCTGAGGGATGGCCTGAACCTCACCCGAC^ 

GTTCT^CTGCTCAGCCAGCCTGAGAACGGTTGGCTCGCGGTCTACCGAACCGACTACGGCAATTTCTTTCGCCA 

CTGGTGGCa^GGTCTGCTGCTGACCCCGGCGCTGCTCCTGGCCGGTTGGCTCGGGATGCGTTGGTACACCAGCAGCGTCGTC^ 

CCGGTGCATCGGGCGCACCGGCAACTGGTGGAGAGCGACACCTTCAGCCGGACGCTGATACAGACCGCGCCGGTG^ 

GTGCTGACCCAGGATGACCAGCAACTGGTGACCTGCAACCACTTGGCCGCCCAGTGGCTGGGCGGGCCCACGGAGATC 

CTGACTTCCAACTGGAAGCTTTTCGATGCGCGTGGGCAGGTACCAGGAGACATCTGTATCCAGGTCGGTGGGCGCT^ 

ACCGCCTTCGCGGCGACCCGCTATGCCXSGCACCGAGGCGGTACTGTGCGTATTCT^CGACATCACG 

ACCGCGCTGTCCAATGCGAA6CGAGCAGCGGATGCCGCCAGCCAGGCCAAGACCCTGTTCCTGGCCCGCATGAGCCATGAAATC 

OTTACTCCCCTGTACGGTGTCCTTGGCACCCTGGAGTTGCTCGACCTGACCACCCTGAACGAGCGGCAACGCGCCTACCTAC 

ACCATCCAGAGTTCGTCTGCGACGCTCATGCAACTGATTAGCGATGTGCTGGATGTCTCGAAGATCGAAGCGGGGCAGATGGCT 

CTGACCCTGGCCGCCTTCAATCCGCTGGACCTAGTGCGGGAAGTGCTTGGCAACTTTGCCGCCAGCGCCATGGCCAA 

CAGTTCTATGCCTGCATCGACACCGAAGTGCCGGCGCAACTGATCGGTGACGTGACGCGGATTCGCCAGGTGCTCAAT^ 

GTGAATAACGCGCTGAAGTTCACCGATATCGGACGGGTGGTCCTGCGCGTGAAGTTGCTCTCCCGCAATGATGGTCGAGCCCTG 

TTGCAGTGGCAGGTCGCCGACACCGGTATCGGTATCGCACACGAACAGCAGGAGCGCTTC^ 

GGAGCGCACCATGCCGGCGGCACGGGGCTAGGACTGTCGATCTGCTGGCATCTGGCGGAAATGATGGGCGGTCACCTGCG^ 

GTCAGCGAGACAGGGCTCGGCAGCAGCTTCAGCCTGGTGCTCGAGTTGCCCGAGGACGAACAGTCCGGGCTGGCTTGCCG^ 

GGGCTCTTGAAATCCGCTTGCGTCCATGTGCGCTCGCCCGTGCGGGAGCTAGCCGACAGCGTAGGGGCGTGGCTGAAAGCCTGG 

GGCTGCAAGGTCAGCAGCGGCGAGGCGGCGCCCTCCGAGCTGGAGACTTGTGTGCTTCTGGAGCTGCTGCCC^ 

CCTGCTTCTTCGCCCTGGCCAGGCCCCCGGGTGCGCGCGTCCATGGATGCGCCTTGCCAGCCGGAGCTGCGTGAGGACGGCTGG 

CXSTGTCGGCCTGCAci^CCTGGCGGGAATCGGCCAGGCCCTGGCGCAGGCTCTGGGTGGCGATATCCCCGAGC^^ 

AATGCCTGCGCCCGCTCGGGGAGACTCGACCTGGAAGTGCTGGTCGCCGAGGACAACCCAGTCAACCAGGCGCTGCTTCGCGAG 

CAACTGGAAGAGCTGGGTTGTCGCGTGAGCCTTGCCGGCGATGGGCGGCAGGCCCTGCAGCTGTTCGACAGTGGTCGCTTCGAC 

CTCCTGCTCAGCGACGTCAACATGCCGAACy^TGACCGGCTACGAACTGACCCAGGCGCTGCGCGAACGAGGCGAGACGCTGC^ 

ATCATCGGCGTGACCGCCAACGCCCTGCGAGAAGAGGGCGAGCGCTGCCGGGCAGTGGGAATGAACAGTTGGCTGGTGAAGCCG 

ATCACTCTGCATACCTTGCATGAACTGCTCAGTGAGTTCGCTCGCGCAGGTGTCGTGCTTCCCGCGCAAGCGCGAGACCTCGGC 

CCGCCCGCGCAGCTCGACGACGGTCTCTCACCGCAGGTGCCGGAACGCATGCGCGCGCTTTTCCTTGAGACCATGGGCAA 

CTGGAGGCCGCCCGGCAAGCGATTCGCCGCAACGACCCGAAGGGGCTGCAGCAGGACCTGCATCGCATGGCC 

GTGATGCGTGCGCGAACGCTGGTGGTGATGTGTCAGGGCGCCGAGGAAGGCCTGCTGGAGTCGCGCCTTGAATGTTCCGCCGTG 

GAGATTGGCGAGGTGCTCGTTCATATCGAGCAGGCGCTGGAGTTTGTGAGAAAGACGGGCTGA 

Protein sequence: (SEQ ID NO: 141) 

VKSASi^EHpNKLLLKWTTLSQSLSIGLIC\nAn^TVLLFSICYWSLGRLFQEEEDKVS 

TTQKYDYDVVPLQimLLAKENGIjAVYEGREFSFAMPFLLATKHALSADSSGbPFSLGVLL;^ FWSVSAYPAPQLL I FDLS 

GSTRIiAVPSIPSTAQRDRLSGSYPMIVERILARLRTRPVGEDAQRVHWIRADRYRDSALEMLGVARVDLPETLWWHDEPNHLII 

AASLLDLRRiNDFEQLVERPAFDSYSLVSPDGEVLLGAAPATGLRDGLNLTRQGVAVQLLSQPENGWIiAVYRTDYGNFFRHSRW 

LVAGLLLTPALLl^GWLGMRWYTSSVVNPVHRAHRQLVESDTFSRTLIQTAPVALVVLTQDDQQLVTCNHI^ 

LTSNWKLFDARGQVPGDICIQVGGRYLQTAFAATRYAGTEAVLCVFNDITVHCEAETALSNAKRAADAASQAKTLF^ 

RTPLYG\n^TLELLDLTTLNERQRAYLRTIQSSSATLMQLISDVLDVSKIEAGQMALTtJ^FNP 

QFYACIDTEVPAQLI GDVTR I RQVLNNLVNNALKFTDI GRWLRVKLLSRNDGRALLQWQVADTGI GI AHEQQERL FEAF YQVS 

GAHHAGGTGLGLSICWHLAEMMGGHLRIWSETGLGSSFSLVLELPEDEQSGIJVCRPGLLKSACVHVRSPVREI^ 

GCKVSSGEAAPSELETCVLLELLPMAAGPASSPWPGPRVRASMDAPCQPELREDGWRVGLHNIiAGIGQALAQALGGDIPEQTPA 

NACARSGRLDLEVLVAEDNPVNQALLREQLEELGCRVSLAGDGRQALQLFDSGRFDLLLSDVNMPNMTGYELTQALRERGETLP 

IIGVTANALREEGERCRAVG^mSW^JVKPITLHTLHELLSEFARAGVVLPAQARDLGPPAQLDDGLSPQVPERMRALFLETMGKD 

LEAARQAIRRNDPKGLQQDLHRMAGSLAVMRARTLVVMCCK3AEEGLLESRLBCSAVEIGEVLVHIE 
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FIG. 18 

RL039 

DNA sequence: (SEQ ID NO: 18) 

ATGCGTCCGGGGTCAATAGTTGGAATTAGAACACAAGAGAAGCCTATGAGTAAGCTCAAGATA 
GTACTGGCCGATGACCATCCGATCGTGCGTATGGGCGTATGCGACATGCTCGAGCGCGACGGT 
CGGTTCGAGGTGGTGGGCGAGGCCTCCACGCCCAGCGAACTGGTCGAGGTGTGCCGGCAGAGC 
GAGCCGCATATCGCCATTACCGACTACAGCATGCCCGGGGACGAGCGCTACGGCGATGGACTG 
AAACTGATCGACTACCTGTTGCGCAACTTTCCTCGTACTAAGGTGCTCATCTTCACCATGGTC 
GGCAACCGCCTGATCCTCGACAGCCTCTACGATCACGGGGTGTCCGGCGTGGTGCTGAAGAGC 
GGCGAACTCGACGAGCTGCTCTTGGCGCTCGACGTGGTGAAGCAGAACCGCGTCTACCGGGGC 
GCGAACATGCTCGACCCGACCAGTGTTCTGGCGAACCGCGACGAAGTGGAAAGCCGCTTCGCG 
CGCTTGTCGATGAAGGAGTTCGAAGTACTCCGTCACTTCGTTTCCGGCAGCAACGTCTGCGAT 
ATCGCACGGCTGCTGAAACGTAGCGTCAAGACCGTAAGCACGCAGAAGGTCTCGGCGATGCGC 
AAGCTGGAAGTGAACAGCGACCAGGCCTTGATGACCTTCTGCGTGCATGCCAACTTGTTCCAT 
TGA 

Protein sequence: (SEQ ID NO: 142) 

MRPGSIVGIRTQEKPMSKLKIVLADDHPIVRMGVCDMLERDGRFEWGEASTPSELVEVCRQS 
EPHIAITDYSMPGDERYGDGLKLIDYLLRNFPRTKVLIFTMVGNRLILDSLYDHGVSGWLKS 
GELDELLIALDWKQNRVYRGANMLDPTSVLANRDEVESRFARLSMKEFEVLRHFVSG 
lARLLKRSVkTVSTQKVSAMRKLEVNSDQALMTFCVHANLFH. 
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FIG. 19 

RL040 

DNA sequence: (SEQ ID NO: 19) 

GTGTCCAGTAAGATCCTGCTGCAAGGGGCACTGCTCGGCCTAGCAATGCTGGCCGTGCTGGAC 
GCCCGAGCCGGAGTCACCGCCGAGCGCACTCGGGCAATAATCGCCGAGGGGCACCGCGAGACG 
TCGCTGCTGCTGGTCAACCAGAATGCCTATCCGGTCATAGTGCAGACCTGGATCGACGATGGC 
GCCCCGAACTCGACACCGCAGTCTGCCCGCGCGCCGATCATGCCGCTACCGCCGGTGTTCCGC 
CTCGAACCCGGACAGCAACGCAGCCTGCGCCTGCTGCGGACCGGCCAGGCGCTGCCAGGGGAC 
CGCGAATCGCTGTACTGGTTGAACCTCTACGAAATCCCGCCGCAAGCCACCGGGCTGCTGGCC 
GAAGGACAGTCACGGCTGACCGTTACACTGCGCACCCAGATGAAAGTCATCTACCGCCCTCGC 
CCTCTGGCCAGAGGTGCGGAAGAAGCGCCACACCAGCTCAGGTTCGAGCGGCGGGGCGAAACA 
CTACAGATGGAGAACCCTACTCCCTATTTCATCAGCCTCGCCGGCGCCGAGCTTGGCGGCCAC 
ACCCGCCTGGCGGCGGCCGAACTGTTGCCCCCCTTCTCCAGGCGCGTCCTGGCGCTCCGCCAG 
GCGCTGCCCGGCGGCCAGGCCGAGGTGCGCTTCAGCTGGATCGATGACGGCGGCAATCTCCAG 
CAGGGACGGAGCCTGCTTCACTGA 

Protein sequence: (SEQ ID NO: 143) 

VSSKI LLQGALLGIJQJVILAVLDARAGVTAERTRAI lAEGHRETSLLLVNQNAYPVI VQTWIDDG 
APNSTPQSARAPIMPLPPVFRLEPGQQRSLRLLRTGQALPGDRESLYWLNLYEIPPQATGLLA 
EGQSRLTVTLRTQMKVIYRPRPLARGAEEAPHQLRFERRGETLQMENPTPYFISLAGAELGGH 
TRLAAAELLPPFSRRVLALRQALPGGQAEVRFSWIDDGGNLQQGRSLLH 
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FIG. 20 

RL041 

DNA sequence: (SEQ ID NO: 20) 

ATGAAAACATCCCTGCGCGTCCTGCCTCTGCTCCTCGCGCTGCTCGCCTCGTCTAGCTGGGCG 
ACCTGCTACAAGGTCACGGCGGTAGGCAACGCCACGACTACCTCCAACACCCAGATACGTCCC 
GGTGAAGGCTCTGCCGGCACCTGGGCCGGAGCCTGCGATACCTGCAACGGTTCCCTCGGTCTA 
CCGAGCGTGATCAACGTCAGCGACGCCAGCTTCCAGCCCGACGGTAGCTTGATCGCCAGCTCG 
GTGGCGCCGCTCAGCCAATACGGCGACAGCGCCGGCTACGACCCAGAGCGCGTGTTCTTCCGC 
TGTGCTCCAGAGGACGATGTGTACGAGATGTTCTCCACCAATGCCGACGATCTCTACAGCGGC 
TGGTACCTAGGAGGCGACAGTGCGGGCAACTCGATTGGCCTGCAGTCCGCCTATCGCACCGCC 
TGGCCCAACGTGCTGCTGCGCCTAACCCACGTGGAAACCGGGCAGTATTTCACCGATGTCTGG 
CGCGAGCGTCTGCTCGGCGGGCTCGATATCGACTCGCGAGGCTTTCAACTGGTCAAGGCGAAG 
AACCTCAGCGCGGTACGCGCCGAACTGTTCCGCGCTCCGCTGGAGTTCATCCGCTACTACTCG 
CCGACTACCGCCTCGCGGTTGTACGCCTACACCCAGCCCGCTGGCTACATCGCCATCAAGGGT 
CCCGGCCTGGCCTACCCCAACGTCGGCGCCAGCCATAACGCCAACTACCTCGGCTGGCACTAC 
AACTGGCCGGGCGCCATCGGCCTGTACAACGACGTGACGCTCAAGCGCTATCCCACCTGTTCC 
GTAACCAACGTCACGCCCCACGTTGTGTTCCCGTCGATTTCCCTCAGTGAGATTAATGCCGGC 
GCGAACCGTGAGATGCCCTTCGAGGTGGCCTTCAAGTGCCAAACGGGAGTGATCAACAGCACC 
GCCTCCAGCGGTACTGCACTGGGTATCAGGGCTTCAGCCGGGGCGCAGGCCGCGTCCGCTGCA 
CTGGGCCTGAGGAACGCCAATGGCGGGCTCTCCTACCTAGTTTCCGACCGCTACGGCCAGCCT 
GGTATGGCCCAAGGCGTGGGTATCCGCTTGCTGCGCGACGGCAGTGCGATGAACCTGCTGGTA 
AGCGAGGATTCCGCGATGGGCAGCAATGCCGAAACACGGGGCTGGTATCCAGTGATCGGCAAC 
GCCTCGAACAAGACTGGCGAAGCGGGAGGCATCAGCCAGTACAGCGAGACCTTCCGTGCGCGC 
CTGGAAAAACTCACCGTTGGCAGCATGCCCAGCGTTACCCCGGGACGGGTGGAGGCCAGCGCG 
CAGGTAGTGATTCGTGTCCAGTAA 

Protein sequence: (SEQ ID NO: 144) 

MKTSLRVLPLLLALLASSSWATCYKVTAVGNATTTSNTQIRPGEGSAGTWAGACDTCNGSLGL 
PSVINVSDASFQPDGSLIASSVAPLSQYGDSAGYDPERVFFRCAPEDDVYEMFSTNADDLYSG 
WYLGGDSAGNSIGLQSAYRTAWPNVLLRLTHVETGQYFTDVWRERLLGGLDIDSRGFQLVKAK 
NLS AVRAELFRAPLEF I RYYS PTTASRLYAYTQPAGY I AI KGPGLAYPNVGASHNANYLGWHY 
NWPGAIGLYNDVTLKRYPTCSVTNVTPHVVFPSISLSEINAGANREMPFEVAFKCQTGVINST 
ASSGTALGIRASAGAQAASAALGLRNANGGLSYLVSDRYGQPGMAQGVGIRLLRDGSAMNLLV 
SEDSAMGSNAETRGWYPVIGNASNKTGEAGGISQYSETFRARLEKLTVGSMPSVTPGRVEASA 
QWIRVQ 
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FIG. 21 

RL042 

DNA sequence: (SEQ ID NO: 21) 

ATGTTCTGCCACGTTGAGGCACGGCGCACCGGCAAACTGCCGCTGGCTCTGGGCGGCCTAGCGCTGGCCTTCGCGG 
GCCTGGCCAATGGAGAGGCGCAGTATCGTTTCGACGACAGCCTGTTGATGGGCTCGGGCCTCGCCGGCGGGACCCT 
AGAACGCTTCAACCGGGCCAACCAGGTGGACCCCGGAACCTACCATGTCGATGTCTATCTCAACGGCAGCTACGCC 
AGTCGCACCAGGATCGAGTTCCGCCCCCGGGCCGGCGGCGTCAAACCCTGCTTCGGCGAACGCTTCTTGCGCCGGA 
CGCTGGGCGTCCGCCCCGCCTCTGAGGCCGGCGTGCAAGCGCCTGGAGATTGCCTGGGGCTGGAAGAACGCCTGCC 
AGGCTCGACCTTCAATCTCGACACCGCCCTTCTGCGCCTCGATCTCTCGGTGCCCCAGGCCCTGCTGGATATCAAG 
CCACGCGGCTACGTGGGTCCCGACGAGTGGGACGCTGGCAGTAGCATGGGCTTCGTCAACTACGACGCCAGCTTCT 
ATCGCTCGAGCTTCGACGGAGTAGGCGGCAACGGCGACTCGGACTATGGCTACCTGGGGCTGAGCGGGGGCATCAA 
TTTCGGCCTGTGGCGCCTGCGCCACCAGTCCAACTACAGCTACTCCAGCTATGCGGGAAACACCCGCAGCGACTGG 
AACAGCATCCGCACCTATGCCCAGCGCGCGGTGCCAGGCCTGCGCAGCGAACTGACCCTGGGCGAGAGCTTCACCG 
AGGGCAATCTGTTCGGCAGCCTGGGTTATCGCGGCGTGCGCCTGGCCAGCGACGACCGCATGCTGGCAGACTCGCA 
ACGCCGCTATGCTCCACAGGTACGCGGTACAGCGAACAGCAACGCACGGGTGGTCATCAGCCAGAACGGCAAGAAG 
GTCCACGAATCCGCCGTCGCTCCCGGTCCCTTCGTCATCAACGACCTCTATGGCACCGCCTACGACGGCGATCTGG 
ATGTCCAAGTGATTGAGGCCGACGGCAGCGTCTCGCGCTTTTCCGTGCCCTTTTCCGCGGTTCCCGAATCCATGCG 
CCCGGGCATCTCGCGCTACAGCGCCACCCTCGGCCAAGCGCGCCAGTATGGCGACGGCAACGACCTGTTCGGCGAC 
TTCACCTATCAGCGCGGCCTGACCAACTCGCTAACCGCCAACCTCGGCTCGCGCCTGGCCGAGGACTATCTGGCGC 
TGCTCGGCGGAGGCGTGCTCGCCACGCCCTACGGAGCCTTCGGCTTCAACAGCATCTTTTCCCATGCCACGGTGGA 
GAACGGCCAGCGCAAGCAGGGCTGGCGTGTCGGTCTGAACTACAGCCGGACCTTCCAGCCGACCCAGACCACTCTC 
ACCCTGGCTGGCTACCGCTATTCCACCGAGGGCTATCGCGACCTCGGCGACGCGCTTTCGGCGCGCCACGCCGATG 
AGCACAACGACTCCTGGAACTCCAGCAGCTACAAGCAACGCAACCAGTTCACCCTGCTGGTCAACCAGGGCCTGGG 
GGGCTACGGCAACCTGTATCTGTCCGGAGCCACCAGCGACTACTACGACGGCAAGAGCCGCGACACCCAGTTGCAG 
TTCGGCTACAGCAACACTTGGCGCCAGCTCAGCTACAACCTCGCCTATTCGCGCCAGCAGACCACCTGGTACCGCG 
ATCTGAACGACGACTACGACCCGTCACTGCCGCCGCAATACAACCTGCGGCACGGCAGCGAACGTAGCAACACCTT 
AACCCTGACACTTTCCATGCCGCTGGGGTCCTCCAGCCAGGCCCCGAATCTCAGCGCGATGGCCTCCCGGCGTTCC 
GGCGACAGCCGCGGCAGCAGCTACCAGACGGGCCTCAACGGCACCCTCGACGAAGACCGCAGCCTGAGCTACGCGA 
TTGCCGCCGGGCGCGACAGCGACAACCACGGCAGCGATTTCAACGGCAGTCTGCAGAAACAGACCTCGGTGGCGAC 
GCTGAACGCCGGCTATGCCGAGAACAGCAGCTACCGGCAGCTCAACACCGGCCTGCGCGGCGCCGCCGTGCTGCAT 
CGCGGCGGCCTGACCCTCGGCCCCTACGTCGGCGACACTTTCGCCCTGGTCGAGGCCAAGGGCGCCAGCGGAGCTG 
GCGTACGCGGTGGTCAGGGCGCGCGCGTCAACGGCAATGGCTACGCCGTGGTGCCATCACTCTCGCCCTACCGCTA 
CAACCCGGTCAGCCTCGATCCGCAGGGCATGGGCGAAGAGGCCGAGCTGCTGGAGACCGAGCGCAAGATCGCGCCA 
TACGCCGGCGCCGCCGTGCATGTGAAGTTCCGCACACTGACCGGTCACCCATTGCTAATCCAGGCCCAACTCGCCG 
ACGGCAGCGCGCTACCGCTAGGGGCCAATGTGCTCGACAGCCAGGGTGTGAACATCGGCATGGTCGGTCAAGGCGG 
CCAGGTCTATGCCCGCGCCGAGGGCGACAAGGGCCGCCTGCGCGTGCAATGGAGCGAACGCCCAGGGGACGCCTGT 
CTGCTGGATTACGACCTCGACACTGGCCCTCGCCAGGCTATCGAACCCGGACAGGCGGTGATCCGCCTGCAGGGCA 
CCTGCACGCCCGTCTCGGAGGCACCATGA 

Protein sequence: (SEQ ID NO: 145) 

MFCHVEARRTGKLPLALGGLALAFAGLANGEAQYRFDDSLLMGSGLAGGTLERFNRi^ 

SRTRIEFRPRAGGVKPCFGERFLRRTLGVRPASEAGVQAPGDCLGLEERLPGSTFNLDTALLRLDLSVPQALLDIK 
PRGYVG PDEWDAGS SMGF VN YDAS F YRS S FDGVGGNGDSDYGYLGLSGGINFGLWRLRHQSNYS YS S YAGNTRSDW 
NSIRTYAQRAVPGLRSELTLGESFTEGNLFGSLGYRGVRLASDDRMLADSQRRYAPQVRGTANSNARVVISQNGKK 
VHESAVAPGPFVINDLYGTAYDGDLDVQVIEADGSVSRFSVPFSAVPESMRPGISRYSATLGQARQYGDGNDLFGD 
FTYQRGLTNSLTANLGSRLAEDYLALLGGGVLATPYGAFGFNSIFSHATVENGQRKQGWRVGLNYSRTFQPTQTTL 
TLAGYRYSTEGYRDLGDALSARHADEHNDSWNSSSYKQRNQFTLLVNQGLGGYGNLYLSGATSDYYDGKSRDTQLQ 
FGYSNTWRQLSYNLAYSRQQTTWYRDLNDDYDPSLPPQYNLRHGSERSNTLTLTLSMPLGSSSQAPNLSAMASRRS 
GDSRGSSYQTGLNGTLDEDRSLSYAIAAGRDSDNHGSDFNGSLQKQTSVATLNAGYAENSSYRQLNTGLRGAAVLH 
RGGLTLGPYVGDTFALVEAKGASGAGVRGGQGARVNGNGYAWPSLSPYRYNPVSLDPQGMGEEAELLETERKIAP 
YAGAAVHVKFRTLTGHPLLIQAQIJUDGSALPLGANVLDSQGWIGMVGQGGQVYARAEGDKGRLRVQWSERPGDAC 
LLDYDLDTGPRQAIEPGQAVIRLQGTCTPVSEAP 
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FIG. 22 

RL043 

DNA sequence: (SEQ ID NO: 22) 

ATGAATACTTTTCCACTGCCTCCGCTCCGTGCGGCTACGCTGGCGCTCGCCCTGCTGATACCC 
GCCATCCCGGCTCAAAGCAGCGTGGTGATCATCGGTACTCGCGTGATTTATCCCGGCGACGCC 
CGGGAAAAGACCGTGCAGATGATCAATCAGGACGCATTCCCCAACGTGATCCAAGCCTGGATC 
GACAACGACGACCCCTCCTCCACCCCGGAGACTGCAAACGCGCCCTTTCTGGTCAGCCCAGCG 
GTGACGCGCATAGCCCCCGGCAGCGGCCAGACCCTGCGCCTCCTGTATACCGGGCTCCCGCTG 
CCCGAGGATCGCGAATCGTTGTTCCATCTCAATGTGCTGCAGATCCCGCCCCGCGACCTGGCC 
AAGGCCGAGCGCAACCAGATGCTGCTGATGCAGCGCAGTCGACTGAAGCTGTTCTATCGCCCC 
GCCGCGCTGCTTGGCGGCTCGGAGCAGCTAGTCGAGCAGTTGCACTTCAGCCTGGTGCAGGCG 
AGCGGCAACTGGCGTGTGCGGGTGGACAACCCCAGCGGCTACTACGCCTCCTTCGCCGGCGCG 
ATGCTGAGCATCGGCGAACGTCGCTGGCGGCTGCTGTCGAGCATGGTCCCGCCCAAAGGCCAG 
GCCGAGTGGGCGGCGGAACGCCCTTCGCCGCTCGCCCCAGGACCGGTCCAGTTGAACGCCCTC 
TTGATCAATGACTACGGCGCGCGAATGGAGGTCCAGCATGTTCTGCCACGTTGA 

Protein sequence: (SEQ ID NO: 146) 

MNTFPLPPLRAATLAIJVLLIPAIPAQSSWIIGTRVIYPGDAREKTVQMINQDAFPNVIQAWI 
DNDDPSSTPETANAPFLVSPAVTRIAPGSGQTLRLLYTGLPLPEDRESLFHLNVLQIPPRDLA 
KAERNQMLLMQRSRLKLFYRPAALLGGSEQLVEQLHFSLVQASGNWRVRVDNPSGYYASFAGA 
MLSIGERRWRLLSSIWPPKGQAEWAAERPSPLAPGPVQLNALLINDYGARMEVQHVLPR. 
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FIG. 23 

RL044 

DNA sec[uezice: (SEQ ID NO: 23) 

ATGAAACCTCAAAGTACTGCCCTGACTATCGCCGCATTTCTCGCATTGCCGGGTATCGCGGCG 
GCTGCCAATACCATCACCTTCCACGGAGAAGTGACCGACCAGACCTGTTCCGCCGTCGTCGAC 
GGACGAACCGACCCGACCGTGATACTCGACACCGTACCGGTAAGCGCTCTTGACGGCGCAGTC 
GGCAAACCCGCCGGGGAAACCAGCTTCACCCTGCAACTGACCGGTTGCGCCGCTCCGGCGGCC 
GATGCCGAGGAGCACTTCAGCGTGATGTTCCAGGCGGTCAATCCGACCAGCGCCGGCAATCTG 
ACCAATACCGCGTCCGCCGGCGCCACCGGCGTAGCGCTGCAGCTACTGACGGCACCGGGCGGC 
AGCGAGGTCAATCTGGCCGGCGGGTCGGCCGTGGCTGCCGGTGACATCGTGCTCGCAGGAGGC 
GAGACCAGCACCAGCTACGACTATGCCGTCCGCTACATCTCCGAAGCGACCACCGTCACTCCG 
GGACCGGTGCTCGGCTCGGTGACCTACACCCTGCGTTACGAGTAA 

Protein sequence: (SEQ ID NO: 147) 

MKPQSTALTIAAFLALPGIAAAANTITFHGEVTDQTCSAWDGRTDPTVILDTVPVSALDGAV 
GKPAGETSFTLQLTGCAAPAADAEEHFSVMFQAVNPTSAGNLTNTASAGATGVALQLLTAPGG 
S EVNLAGGSAVAAGD I VLAGGETSTS YDYAVRY I S EATTVTPGPVLGS VTYTLRYE 
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FIG. 24 

RL045 

DNA sec[uence: (SEQ ID NO: 24) 

AGTCCGCACGGTAGTGACGACTGGAAGCGCTTCTGTGCTGCCAACAACCTGGAGCCCAGCATG 
AGCCGGCGCGGCAATTGTTGGGATATGCCGTGGCGGAATCCTTCTTCAGTAGTTTGAAGAAAG 
AGCGTATCCGCAAACGCATCTACAAAACCCGAGACATGGCCCGGGCGGATGTTTTTGACTACA 
TCGAGGTCTTCTACACCCGAACCCGGCGGCACAGTCATCTGGGTGGCGTCAGTCCCGAGGCCT 
TTGAAAGCGCCTCG 



Applicants: Laurence Rahme et al 



41/118 



FIG. 25 

RL046 

DNA sequence: (SEQ ID NO: 25) 

ATGGCTGAAGTCACTCAACGAGCAGAGCAGCAACAAGAGAGCCAGAAGACCCTTCTCGGCACC 
ATCATCAGTACGCCCTTCCAATTTCTCGGCGTGATGTTCGGGTCGCTGATCGGCGCAATCATC 
GTGGAGTGGGTTTGCCTGTATTTCTTCTGGCCTGACGCGGGCTGGAAGCATGCCCAGGCCATG 
TTTGAGTACGAACTCAGTTGGCTGTCGCAGGGGCTGCTACACAGCGTCGTCGTGCAGGAGCCA 
GGTCGAACCGCCACCTGGCTGGCCCAGTTGGCCTATGACTGGTTGTTCGTGAAGACCGGGATG 
GTCGACTGGATGACCAACATGACTACCATCGCGCAGGCCGGGCCACGGAGCCCGCTGGACGTT 
CGCTATCTCACCGCCCAGGGTGTCTCCACGCTGCAGAACTACGGCCTGGCCGCGCTGTACACG 
GTGCTGACATTCGTCGTGCGCCTGGTGATCCTGGTCATGACGATCCCGTTATTCGTGATGGCC 
GCGTTCACCGGCCTGGTGGACGGCCTGGTGCGCCGGGACCTGCGCAAGTTCGGCGCCGGCCGG 
GAGTCCAGCTACCTCTACCACAAGGCGCGCGGCAGCATCATTCCGCTAGCGGTCGTCCCTTGG 
ACGCTCTACCTGGCAATTCCCATCAACATCAATCCCCTGCTCATCCTGTTGCCCTGCGCCGCA 
CTGCTCGGCGTAGCGGTGTGCATCACAGCATCCACCTTCAAAAAGTACCTATAG 

Protein sequence: (SEQ ID NO: 148) 

MAEVTQRAEQQQESQKTLLGTI ISTPFQFLGVMFGSLIGAI IVEWCLYFFWPDAGWKHAQAM 
FE YELS WLSQGLLHS VWQEPGRTATWLAQLAYDWLFVKTGMVDWMTNMTT I AQAGPRS PLDV 
RYLTAQGVSTLQNYGLAAL YTVLTF WRL VI LVMT I PLFVMAAFTGLVDGL VRRDLRKFGAGR 
ESSYLYHKARGS 1 1 PLAWPWTLYLAI PININPLLILLPCAALLGVAVCITASTFKKYL . 
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RL047 

DNA sequence: (SEQ ID NO: 2 6) 

ATGGCTGGCCAGTACCCGTTGGAAGCGCTCTTGCGGCCTGCCGTGGAGCTCTACACCACCACCGTGTGC 
TTCACCGCAGCCGCGCTCTGCATCGTCGCGCCGTGGACGTTCTCCCTCACTCCGCTGTTCGGCATCGTG 
GCCGCGCTGTGCTTCGCCTGGCTGGGTATCGTGCGGCTGAAGCAGGCCGGCGTGGTGCTCCGCTACCGG 
CGGAACATTCGCCGACTGCCGAAGTACACGATGACCAGCGCCGAGATGCCGGTCAGCAACGAACACCTG 
TTCATCGGTAAAGGATTTCGCTGGACGCAGAAGCATACGCAGCGCCTGGCAGATACCTACCTGCCCCAG 
TTCGCCTCTTACGTCGAGCCCTCGCCCCTCTACGAGCGCGCGCGCCGGTTGGAGAAGCAGCTCGAGTTC 
GCCCCCTTCCCCCTGAAGCTGGTCGCCAAAGCCACTGCCTGGGACGTGGCCTGGAACCCCGCACGGCCG 
CTGCCGCCCGTGGGCGGTTTGCCTCGGCTCCATGGCATCGAGCCGCGCGAACAGGACGTAGGCCTGCAA 
CTGGGCGAGCGCGTCGGCCACACACTGGTACTCGGCACCACGCGGGTGGGTAAGACGCGCCTCGCGGAG 
CTGTTCATCACCCAGGATATTCGCCGCACTCACTGCCGGGTACGACGCCGGCGGGTGAAGATGGGCCGG 
CGGACCCAGACGGTTCACCACGGCTATCGGCGCCGGCGCGCAGAGGAGCAGCCGGACTACGAGGTGGTG 
ATCGTCTTCGACCCGAAAGGCGACGCTGACCTGCTGAAGCGTATGTACGTGGAATGCGAACGTGCCGGC 
CGCCTGGACGAGTTCTACGTGTTCCACCTCGGTCATCCTGACCTGTCGGCACGCTACAACGCCGTCGGC 
CGGTTCGGTCGGATCTCCGAGGTCGCCACCCGCGTCGCCGGCCAGCTCTCCGGCGAGGGCAACAGCGCG 
GCGTTCCGCGAGTTCGCCTGGCGGTTCGTCAACATCATCGCCCGCGCGCTGCACGCGCTGGGTATCCGG 
CCTGACTACCAGCAGATCCTCCGGCACGTCGTGAACATCGATGCGTTGTTCGTCGAATATGCGCAGAAA 
TACATCAGCGAGCACGATCCCAGGGCCTGGGACACCATCATCCAGATCGAGGGCAAGCTCAACGACAAG 
AACT^TCCCGTTCAACATGAAAGGACGGCCCCTGCGGGTCGTAGCCATCGACCAGTACCTGACACAGAA^ 
CGCATCGCCGACCCGGTCATGGAAGGCTTGAAGAGCGCCGTGCGCTACGACT^GACCTACTTCGAC^G 
ATCGTGGCCTCGCTGCTGCCGCTACTGGAGAAACTCACTACCGGGCGGATCTCGGAGCTTCTTTCGCCC 
AACTACGCGGACCTCAACGATCCGCGGCCGATCTTCGACTGGATGCAGGTCATCCGCAAACGCGCCGTG 
GTCTACGTCGGCCTCGACGCACTATCGGATACCGAGGTCGCCGCCGCGGTGGGCAACTCCATGTTCAGC 
GACCTGGTCTCGGTAGCGGGTCACATCTACAAGCATGGTGTCGATGACGGCCTGCCCGGCTCGCTCGCC 
AGCGGCAAGGTCCGCATCAACCTGCATGCCGACGAGTTCAACGAGCTGATTGGCGACGAGTTCATCCCC 
ATGGTCAACAAAGCGGGCGGCGCCGGCGTGCAGGTGACGGCCTACACCCAGACCATGAGCGACATCGAG 
GCCAAGATCGGCTCCCGCGCGAAGGCCGGTCAGATCATCGGCAACTTCAACAACCTGTTCATGCTGCGG 
GTGCGCGAGACCGCCACGGCCGAACTCCTTACCAATCAGCTCCCCAAGGTCCAGATCTACACCAGCACG 
CCGGCGAGCGGCGCCAACGACGCGATCAACAACAACAAGAAGGTAGCCTTCACCTCCAGCTCGCACGAC 
CAGGTGCAGATGACCAGCGTGCCGATGCTCGAGCCGGCCCACATCATTGGTCTGCCCAAAGGACAAGCG 
TTCGCGCTACTCGAGGGCGGCAATCTCTGGAAGATCCGAATGCCGCTGCCGGCGGTCGCCCCCGACGAG 
GTGATGCCGAAAAGCCTGCAGGAGCTGGCTGCCGGTATGCGCAAGGGCCAGGCCGCCAACAGCGAGTGG 
TGGGAGGCGCCGGGATACTCCGCCCTGCAGGATGGTCTGCCCCAGGACCTGGTCGACGATTTCCGTCAC 
CTCGGCACCGGTGAGGATGCCGCCTGA 

Protein sequence: (SEQ XD NO: 149) 

MAGQYPLEALLRPAVELYTTTVCFTAAALCIVAPWTFSLTPLFGIVAALCFAWLGIVRLKQAGVVLRYR 

RNIRRLPKYTMTSAEMPVSNEHLFIGKGFRWTQKHTQRLADTYLPQFASYVEPSPLYERARRLEKQLEF 

APFPLKLVAKATAWDVAWNPARPLPPVGGLPRLHGIEPREQDVGLQLGERVGHTLVLGTTRVGKTRLAE 

LFITQDIRRTHCRVRRRRVKMGRRTQTVHHGYRRRRAEEQPDYEWIVFDPKGDADLLKRMYVECERAG 

RLDEFYVFHLGHPDLSARYNAVGRFGRISEVATRVAGQLSGEGNSAAFREFAWRFVNIIARALHALGIR 

PDYQQILRHVVNIDALFVEYAQKYISEHDPRAWDTIIQIEGKLNDKNIPFNMKGRPLRVVAIDQYLTQK 

RIADPVMEGLKSAWYDKTYFDKIVASLLPLLEKLTTGRISELLSPNYADLNDPRPIFDWMQVIRKJ^ 

VYVGLDAIiSDTEVAAAVGNSMFSDLVSVAGHIYKHGVDDGLPGSLASGKVRINLHADEFNELIGDEFIP 

MVNKAGGAGVQWAYTQTMSDIEAKIGSRAKAGQIIGNFNNLFMLRVRETATAELLTNQIiPKVQIYTST 

PASGANDAINNNKKVAFTSSSHDQVQMTSVPMLEPAHIIGLPKGQAFALLEGGNLWKIRMPLPAVAPDE 

VMPKSLQELAAGMRKGQAANSEWWEAPGYSALQDGLPQDLVDDFRHLGTGEDAA 
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FIG. 27 

RL048 

DNA sequence: (SEQ ID NO: 27) 

ATGACTACTCATCTGATCACCCTAGTCATCAAGCAGCCGAGCGACGCTCAGGCACGCCAACTC 
ATGTACCAGGAGTTGCTCGGACTGATCTCACGCTACGGCGGTGAGGTGACGTCCAAGGCCTTG 
GAGGACGAGTCGACCCTCTGCGAGCTGCTGGTGCAGATGCTGCCTGATCATGAGGTAGAGCAA 
GCCAGGAAACAGGTGCTCGAACTTCATGCCAAGGGCCGCCTGCAGGCGCCGGCAAGCCTGAAG 
GTGTAA 

Protein sequence: (SEQ ID NO: 150) 

MTTHLITLVIKQPSDAQARQLMYQELLGLISRYGGEVTSKALEDESTLCELLVQMLPDHEVEQ 
ARKQVLELHAKGRLQAPASLKV 
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FIG. 28 

RL049 

DNA sequence: (SEQ ID NO: 28) 

ATGAAGAAGTTCCTTGCCACGCTGGCATTTTGCACGGCGTTCGCGACTCAAGCCTGGGCCGCC 
GGGCTGATCGTTGTCGAAGACCTCGGCGGCGCCTCGGCGCTCCCCTACTACCAGGGCCTGGAT 
CCGCAGCCATCCGCTTCCGCACCAGGACCTGGCGACCTGGGCGTCCGTGGCTCAGGTGCGTTT 
CCAGTTCGCTCCGCCCGCCTATCGCCAGGACGGGTCCAGGGGCGCGCCATCAACGCTCCAGGC 
CTGCAACTGCTGTTCCTGGTCGGCGACGACACGCTGTCTCGAACCTGGCTGAAAGAGCGAGGC 
GACGAGCTTCGAGACCTCCAAGCCGTGGGCCTGGCAGTGAACGTGGCCAGCGAAGCGCGCCTG 
ACGGAAATCCGGGCCTGGGGGAAAGGACTTCAGATATTGCCGGCGCCGGCGGACGACCTGGTC 
GACCGGCTAGGGCTGCAGCATTACCCCGCCCTCATCACATCCACCGCCATCCAGCAGTAG 

Protein sequence: (SEQ ID NO: 151) 

MKKFLATLAFCTAFATQAWAAGLIWEDLGGASALPYYQGLDPQPSASAPGPGDLGVRGSGAF 
PVRSARLSPGRVQGRAINAPGLQLLFLVGDDTLSRTWLKERGDELRDLQAVGLAVNVASEARL 
TE I RAWGKGLQ I L PAPADDL VDRLGLQHY PAL I TS TAI QQ . 
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FIG. 29 

RL050 

DNA sequence: (SEQ ID NO: 2 9) 

ATGGCAACGTCTGTAGTTCGAGCCCTCCAGTTGGCCACCCTGCTGGTCCTGGTCAACATCGCT 
CAGGCCGCCGTGGATCCACCGCCGGCGTACAAGCAAATCGCCCTGCCCAAAGGGGTTCCGGCC 
GAGGTGCTCTACTCGGTCGCGCTGACCGAGAGCAAGGTCCTGCTGCGCGGCGAATACGTTCCC 
TGGCCCTGGACATTGAACGTCGCCGGGAAATCTTACTACTACGCGACCCGCACCGCCGCCTGC 
ACAGCGCTACTCGCGGCGATCAACCTCTACGGGGCCAAGAGCGTCGATTCCGGCCTCGGCCAG 
GTCAACATCGGCTGGAACGGACATCGTTTCTCCAGCCCCTGCGAGTCCCTGGATCCGTACAAG 
AACCTGGACGCCACCTCCGACATCCTGATCGAGCAGCGGGACGCCCTGTATGCATCCGCCCCG 
GGAAGACCGGTGGACTGGATCCAAGTTGCCGGCCGCTACCACCGCCCCGCCGGCGGCGCGCCT 
GCCGCCAAATACCGTAGGACGGTTTCCCGCCACCTTAGCCAAGTTCTCGGCGTCAACCTACTG 
GTGACCAATCCATGA 

Protein sequence: (SEQ ID NO: 152) 

MATSVVRALQLATLLVLVNIAQAAVDPPPAYKQIALPKGVPAEVLYSVALTESKVLLRGEYVP 
WPWTLNVAGKSYYYATRTAACTALLAAINLYGAKSVDSGLGQWIGWNGHRPSSPCESLDPYK 
NLDATSDILIEQRDALYASAPGRPVDWIQVAGRYHRPAGGAPT^KYRRTVSRHLSQVLGVNLL 
VTNP 
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RL051 

DNA sequence: (SEQ ID NO: 30) 

ATGATCAGAACCGTATCGCTCCTGTCCGGCCTGATGCTGCTGCTGAGCTATCCCGCAGCCGGCCAGGAGGCGGCGGCAAGCCG 
AGAGGCCAGCAGCCAACTGTCCGGTAGCCAACTCGGCACGCTGAAACAACAGACATCTCAGAGCGACCTGGCCCAGGAGTGGG 
GACTGAACCAACAGGAATGGACCCGCTACCAGACGCTCATGCAAGGCCCCCGGGGCGCTTACTCGCCTGGTATTGATCCGCTG 
ACCGCGCTGGGCATCGAGGCGCGATCGGCAGAGGAACGGCGGCGGTATGCCGATCTACAGGTCCAGGCCGAACGGCGCCGGGT 
CGAGAAGGAACTCGCCTACCAGCGCGCATACGACGAAGCCTTCGCCCGCGCCTATCCAGGCGAGGGGGTGATCCGCCTCACCG 
AAAGCAGCACAGCCAACCCGTCGGGCACGCCGAACATGAGCCCAGCGTTGCAGAGCAGCGGGCGCCTGGCCCTGTTCGTCCAG 
GACAACTGCACCGCCTGCATCCAGCGGGTCCGCGACCTGCAACATGCAGAAAAGGAGTTCGACCTCTACTTCGTCGGTAGCCA 
GAACGACGCAGAGCGAGTGCGGCGCTGGGCAATCCTCGCCGGCATCGACCCGAAGAAGGTTCGCAGCAAGCAGATCACGCTCA 
ATCATGACGAGGGCCGCTGGATGGCCCTAGGACTGGGCGGAGCCCTTCCCGCCCTGGTACAGGAGGTGAACGGCCGATGGCAA 
CGTCTGTAG 

Protein sequence: (SEQ ID NO: 153) 

MIRTVSLLSGLMLLLSYPAAGQEAAASREASSQLSGSQLGTLKQQTSQSDLAQEWGLNQQEWTRYQTLMQGPRGAYSPGIDPL 

TALGIEARSAEERRRYADLQVQAERRRVEKELAYQRAYDEAFARAYPGEGVIRLTESSTANPSGTPNMSPALQSSGRIJUiFVQ 

DNCTACIQRVRDLQHAEKEFDLYFVGSQNDAERVRRWAILAGIDPKKVRSKQITLNHDEGRWMALGLGGALPALV^ 

RL. 

Ri:i052 

DNA sequence: (SEQ ID NO: 31) 

ATGAAACGCCCATCCCCTGCATCAATGATTCTTGGCCTCTGTTTGACGGCAATGGCCGGCCTGCTGAGCTACCAGCAGTACCA 
ACTCGTTCAGCTCCGATCAGGCGTGGACAGTGCCGCGGAAAAGGCCTCGCTGGAGGCGATCCTGGCTCGCTTGAGTCGAGTCG 
ACGAGCGCCTCGACGCCGTGGATGGACAGCACCTGGTCAGCAACGAGGACTTCCGTTCAGGCCAGCAGGCGCTGTCCAACCGA 
ATTGACGCTGCGCAGGCGTTCGCCAAGCAGGCCTCCGATGCCGTCGAGAACCTGGCTCAGACCACCGCCTCGGCCGGCGACCT 
CTTGGTGCTCAAGGCAACCGTGGAGACACTGGACGGTTCTGTCCGCACGCTTCAAGAAAAGCAGGCCAAGGCGCCGCCGCTGA 
TCGTGCCAGCGCCAAAACGCCCCATACCCGCCAAGCCCAAGCCGAAACCCAAACCGATGGAGCCCCCGCCCTTCTCGATCCTT 
GGCGTGGAGTATCGCGGGGGAGAACGGTTTCTGTCGGTTGCACCTCCGGGATCCACCCAGCTCAGCCAGATCTACCTCATTCG 
CCGGGGAGATGCCGTCGCCGGCACGACCTGGCGACTGACCGACCTTGACGATGGTACCGCGCACTTCGACGTCGCCGGCACCT 
CGCGCAGCGTTCGCATCCAACCATAG 

Protein sequence: (SEQ ID NO: 154) 

MKRPSPASMILGLCLTAMAGLLSYQQYQLVQLRSGVDSAAEKASIiEAILARLSRVDERLDAVDGQHIiVSNEDFRSGQQ^ 
IDAAQAFAKQASDAVENLAQTTAS AGDLLVX.KATVETLDGS VRTIiQEKQAKAPPLI VPAPKRP I PAKPKPKPKPMEPPPFS I L 
GVEYRGGERFLSVAPPGSTQLSQIYLIRRGDAVAGTTWRLTDLDDGTAHFDVAGTSRSVRIQP. 

RL053 

DNA sequence: (SEQ ID NO: 32) 

ATGCCGCGCCGCTTGATCCTCTCGGTCACGGAGCGGGATATCCTATTTGCACTGCCGGTAAGCCGAATGACCTCACTGACTAC 
TCCACCTCAACGAGTCCGCCCCATCGTCGATCCGCCAGCGACGCGGCGATGCCAATCACTTGGTTTTTTCGGTGCAGGTCAGC 
CTGCTGTGCTATCCAGCGTTCAGCCCTGATGCGCGACGAAGAGCCCCCGAG 

RL054 

DNA sequence: (SEQ ID NO: 33) 

ATGGCCGAAGCTATCAGAAAGGATGCAATGATGACAAAACTCTACTTTGATCTTCTGAACTCGCCTGCCGAGGCTCATTCGTC 
GATACAAAAGTCTTTATCTGTGCAGGCAATCTCCACAACTGTCCCAATACTGGAGTTTCCTTCGGAAACCGTATACGCCTATG 
CATCGTACATAAATGCATTAAGTATCGGTCAACGCATAGATCCTGCATTCACCCAGAGCTTAACGAGTGCCATATCCAACCTG 
GCAGGTCGCCCGATTGCAGTAAGCGACATTTACCAAAAAATTCATGAAACCACACTGAGAACACCTGTTGAGATGGGCGTTCG 
TCCTAATAGCATCACCTTTGAGGAGTATCA(3GCCACCATAAATCAGCAAGCCATCAACATGGTTCAAGATATGCAGGATGGAG 
ACAAAGGTGAGAAGGTGGAGGCCCTCCAGGCCAATATGCAGTTCCTGTATGGACAGGAGATAAATACTGATTTCATCGCTCGT 
AATGAACTCGCTGCTGGGCAGAGAGCGAAAACCGTCGCAATAGTTCAGGGGCATATCACCATCGGGTACGGCTTCGATACCTT 
CGTGCATGAAGCGTCCGAGCTAAACTCTTTGAATCTTGTTGGTTCTACGCGACAGAAGGTATTACCTGCATTGCAGCTATCAA 
CGTCCGACCCAGGCTTCTGGAGCGTCTATGCCTTGCTGGGACAAAGTCTCACGGATGACGATGGGCTATTACTCTTTAGTGCC 
AAAGCGCGAGCTGTTGTTCAACGCATAGCAAGCAACCAGTTTGCAGGTAAGTGGAATGGGCTACCCCCAGCTATCAAAACGGT 
TGCGCTTGATCTATATTATCAATATGGGCAGACTGGTAATTTTCCAAAATTTCAACAAGCTATAAATAGCCATGATTGGCCGG 
CAGTCATCCATGAACTTAGAAACTGGAATGGTGTACCGAATGATCCTCTCCAGTTCATTACAA7UVCGATTGGAAGAGCGAGCC 
AAGTATCTGGCAATATCCTTCAACTATGAGCAATGA 
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Protein seq[uence: (SBQ ID NO: 155} 

MAEAIRKDAMMTKLYFDLLNSPAEAHSSIQKSLSVQAISTTVPILEFPSETVYAYASYINALSIGQRIDPAFTQSLTSAISNL 
AGRPIAVSDIYQKIHETTLRTPVEMGVRPNSITFEEYQATINQQAINMVQDMQDGDKGEKVEALQANMQFLYGQEINTDFIAR 
NEIJ\AGQRAKTVAIVQGHITIGYGFDTFVHEASELNSLNLVGSTRQKVLPALQLSTSDPGFWSVYALLGQSLTDDDGLLLFSA 
KARAWQRIASNQFAGKWNGLPPAIKTVALDLYYQYGQTGNFPKFQQAINSHDWPAVIHELRNWNGVPNDPLQFITKRLEERA 
KYLAISFNYEQ. 

RL0 55 

DNA sequence: (SEQ ID NO: 34) 

ATGAAC AACAC AGTGAGCGAAACGCAACAGATC AATATTT AC C AAAAT CCGGGGC AGTCT ATTT C CGGTCT CT ACAAGGGGCT 

GGCTAACCAGTGCTCTCCTGGCCAGCCATTTCCAGAGGTACAGCTTGTGGAGGCTTGGGATATCCCTCTCGTACTCCATCCGG 

AGTTTGTGCCTAACGGAGATGTCTCGAAAATCGATAAGGAGTACGGAACGATCCTTGCTGCTGAGTCAGCTCAGGTTATCCTG 

CTTCAACTCCAAATGGCTCJVAGACAAGGCTAAGGCGTGCGGGGAGGTTACAGCCTTGATCAGTTCTGTCTCCTCCy^ 

TACCATTAAGAGTCGTCATGGTGCTAATTATCTAAACCTGCTGAAACAATCACCGAACCGATACCCGACTAGCGTCGGAGTTG 

AGATCATGTCAGGTGGCAGTCCGAACCAGGATTCTGGAATCGAGGTCTCTTACGGTGCCAGTCTCGGCCGTCTAACTCAATCA 

CT^CTTCAGGCGATGAATCTGCCTGCCAGTCTCAAACAGTTGCTCACTCAGGGAATCGGTGTGAAGCTTTCTCAGCCTGAATA 

TTGGCCTGCTTACAACAACATAGCCACTGGTATTCGTTATACAACCGGAGTGGCGATAACGTTGGCCTATTGGGCCACGGTTT 

AG 

Protein sequence: (SEQ ID NO: 156) 

MNNTVSETQQINIYQNPGQSISGLYKGIJ^QCSPGQPFPEVQLVEAWDIPLVLHPEFVPNGDVSKIDKEYGTILAAES^^ 
LQLQMAQDKAKACGEVTAL I S S VS SNLNT I KSRHGANYLNLLKQS PNRYPTS VGVE IMSGGS PNQDSGI EVS YGAS LGRLTQS 
QLQAMNLPASLKQLLTQGIGVKLSQPEYWPAYNNIATGIRYTTGVAITLAYWATV 

RL056 

DNA sequence: (SEQ ID NO: 35) 

ATGACCCAAGCTGCGAAAATACCAGCAAATGAGTACTCATTGGGGGATGGAAGAGGCTACATCAATATCTGGCCGGAAAAGGA 
TGAGGCTCAGGCATTTCTTATCCATAATGATGGGCCTAATGGGGCTACATGCAGCCTTAAAGGCACTCTTAGAGATAATAAAG 
GAGTGGTGCATTCGCCGTATTCCTCTGCTTCATGTTTGCTAAGTATCACCCAGACAGGGCTGCTGTCAGTAAGCGTCAAACGT 
GAGGAAAATTCGCCAAGCTGCTCTGCATGGTGCGGTCCTAGAGTTTGGTTTGAAGGAGCCTATAGCGTCCCGCCCAAGGGCTG 
CTACTATATGCAAATAAGGAAAAAAACTCGACAAATGTTGGGTATGATTGAGAAAAAAGAGCTTGATGCCGCTCGCGCCTTAT 
CAAATAAGCTTTTGTCAGACTGCGCAACCGAGCTAGCCTATCCTGCCAAGATATACTTGACGAACACACTTGCCATGATCAGT 
GCTGAAAAGGGAGAGAATGCTCGCTGTTTGGAGTATGCCCATCGGGTGCAAAAGCAAATTCCTGTAAGAGATGACGGCCAACC 
GGCTGAAGACTTGCTCCCGGCGGAGCACGCTTTCGCTATGGAACAACGCGCCAAGGCTGATGCTCTGTCTGAGCGATGCAGCG 
ACGAGAAATAA 

Protein sequence: (SEQ ID NO: 157) 

MTQAAKIPANEYSLGDGRGYINIWPEKDEAQAFLIHNDGPNGATCSLKGTLRDNKGWHSPYSSASCLLSITQTGLLSVSVKR 
EENSPSCSAWCGPRWFEGAYSVPPKGCYYMQIRKKTRQMLGMIEKKELDAARALSNKLLSDCATEI^YPAKIYLTNTLAMIS 
AEKGENARCLEYAHRVQKQIPVRDDGQPAEDLLPAEHAF7\MEQRAKADALSERCSDEK 

RL057 

DNA sequence: (SEQ ID NO: 36) 

GTGCTGGTAGAGCGTTTGCCGACTGATGTTGAATTCGCGGGCGAGCTGAGCCTTGGGCTCGCCGGCCGCTGCCCGCAGCCCCA 
GGGTAGCACCTGCTTGTCGGACAAGGCCTCTTTGCGGCCCCGGTACGCGCAGAGCTTGATATCCTCGCGCTACCGCGCTGGTG 
CTGCTTGTATGCTGCTCAGCAAGCCAGCCGCCGGTCTTTTCAGC3GTAAGCGTACGGCCAATACACCTTTACTTAGGTTC3A 

Protein sequence: (SEQ ID NO: 158) 

VLVERLPTDVEFAGELSLGLAGRCPQPQGSTCLSDKASLRPRYAQSLISSRYRAGAACMIiLSKPAAGLFRVSVRPIHLYIiG 
RIi058 

DNA sequence: (SEQ ID NO: 37) 

AT(3GATATTCGCCTGGAGATTTTAGCGCTTGAACAGCTGTTGCTAGAGCCGGAATCGAGAAAGAATGATCGACTGCTTAAACA 
GCTGCTTACCGAAGACTTCGTTGAATTTGGAGCTATCGGCAAAAGCTGGACGAAAGCGGAGGTGATCGTGGGACTAAAATCCC 
AGACTTGGATCAAAAGGACAATCGAGGATTTCAAACTGCGTGTGCTTGCAGATGGTGTCGCGTTAGCAACGTACCGATGCCGT 
CATCAAAATGCTAATGGCGATGAGTCGTTATCAATGCGTAGCTCTGTTTGGAAAACCTACC3AAC3ATGGTTGGCACATGGTGTT 
TCACCAAGGCACGAGGGTCTCCGAGTAG 
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Protein sequence: (SEQ ID NOx 159) 

MDI RLE I LALEQLLLE PESRKNDRLLKQLLTEDFVEFGAIGKSWTKAEVI VGLKSQTWI KRT I EDFKLRVLADGVALATYRCR 
HQNANGDESLSMRSSVWKTYEDGWHMVFHQGTRVSE . 

RL059 

DNA sequence: (SEQ ID NO: 3 8) 

ATGACTTCCTCGCCCAACCTTGACCAGATGACCCCGGAACAGCTTCGTGCCTTGGCGGCACAGGCGTTGCAGTTGCAATCCCA 

GGTCGAGGCGATGAGCAGGAAAATCCGCAACAATGAAACCCTCATCGAACAGTTCAAGTTCGAAATCGCTCTGCTCAAACGCC 

ACAAGTTTGCCAAGCGCAGCGAGCAAATCAGTTCGGCGCAAGGCAGCTTGCTGGATGACCTGCTCGACACCGACCTTGAAGCT 

ATCGAGGCCGAGCTGAAACAACTCCTTCCAGCTTCGCCACAAGCCGAGCCACGGCAATCCCCGAAACGTTCGCCATTGCCGCC 

GCAGTTCCCGCGCACGGTGATTCGCCACGAACCTGAAAATACCCAATGCGCCTGCGGCTGCCAACTTCAACGCATCGGCGAAG 

ACGTCAGCGAGAAGCTGGATTACACGCCGGGCGTGTTTACCGTCGAGCAACATGTGAGGGGCAAATGGGCCTGCCGTCAGTGC 

GAAACCCTGATCCAGGCGCCGGTGCCAGCCCAGGTTATTGATAAAGGCATCCCGACCGCAGGTTTGTTGGCCCACGTGATGGT 

GGCCAAGTTTGCCGATCACTTGCCGCTGTACAGACAGGAAAAAATCTTTGGCCGCGCCGGGCTGCCAATTGCCCGCTCGACCC 

TGGCGCAGTGGGTCGGACAAACTGGCGTGCGGCTTCAGCCACTGGTCGATGCACTGCGTGAAGCCGTGCTGAACCAGGACGTG 

ATCCACGCCGATGAAACACCGGTGCAAATGCTTGCACCAGGCGAGAAGAAAACCCACCGGGTCTATGTCTGGGCCTACAGCAC 

GACGCCGTTTTCGGCGCTCAAAGCGGTGGTTTACGACTTCAGCCCAAGCCGTGCCGGAGAACATGCACGCT^CTTCCTAG^ 

ACTGGAATGGCAAGCTGGTCTGCGACGACTTCGCTGGATACAAGGCCGGTTTTGAACAAGGCATCACTGAAATCGGCTGCATG 

GCTCATGCTCGCCGCAAGTTCTTCGACCTGCATGTCGCTAACAAAAGCCAACTGGCCGAACAGGCGCTGCACTCAATTGGCGG 

TTTGTACGAGGTTGAACGCCAGGCTCGGGACATGAGCAACGAAGACCGTTGGCGAATACGTCAGGAAATGGCGGTACCGATCA 

GCAAAACACTGCATGACTGGATGTTGGCCCAGCGCGACCTGGTGCCCAACGGCTCGGCCACAGCTAAAGCCCTCGACTACAGC 

CTGAAACGCTGGGGAGCGCTGACGCGCTACCTGGACGATGGGGCTGTGCCCATCGACAACAATCAGGTGGAGAACCAGATACG 

GCCGTGGGCGCTCGGACGCTCGAACTGGTTATTTGCCGGATCGCTGCGCAGTGGCAAACGAGCAGCAGCTATCATGAGCCTGA 

TCCAGTCCGCTCGCATGAACGGGCATGATCCGTATGCCTACCTGAAGGACGTGCTAACTCGCCTGCCGACGTTACGGTCGAAA 

GACATCAGCCAGTTGCTGCCGCATCAGTGGGTACAGATCTAG 

Protein sequence: (SEQ ID NO: 160) 

MTSSPNLDQMTPEQLRALAAQALQLQSQVEAMSRKIRNNETLIEQFKFEIALLKRHKFAKRSEQISSAQGSLLDDLLDTDLEA 
lEAELKQLLPASPQAEPRQSPKRSPLPPQFPRTVIRHEPENTQCACGCQLQRIGEDVSEKLDYTPGVFTVEQHVRGKWACRQC 
ETLIQAPVPAQVIDKGIPTAGLLAHVMVAKFADHLPLYRQEKIFGRAGLPIARSTLAQWVGQTGVRLQPLVDALREAVLNQDV 
XHADETPVQMLAPGEKKTHRVYWAYSTTPFSALKAVWDFSPSRAGEHARNFLGDWNGKLVCDDFAGYKAGFEQGITEIGCM 
AHARRKFFDLHVANKSQLAEQALHSIGGLYEVERQARDMSNEDRWRIRQEMAVPISKTLHDWMIiAQRDLVPNGSATAKALDYS 
LKRWGALTRYLDDGAVPIDNNQVENQIRPWALGRSNWLFAGSLRSGKRAAAIMSLIQSARMNGHDPYAYLKDVLTRLPTLRSK 
DISQLLPHQWVQI . 

RL060 

DNA sequence: (SEQ ID NO: 39) 

ATGATCCGCATCGATGCGATCTGGCTAGCCACCGAACCGATGGACATGCGCGCCGGCACCGAGACGGCATTAGCCCGGGTAAT 
TGCGGTGTTCGGTGCGGCGAAGCCGCACTGCGCTTATCTGTTCGCCAATCGCCGGGCTAACCGAATGAAAGTGCTGGTGCACG 
ATGGCGTGGGCATCTGGCTTGCCGCGCGTCGACTGAACCAAGGCAAGTTCCACTGGCCCGGCATTCGCCATGGCTGCGAGGTC 
GAACTCGACAGCGAACAACTCCAGGCCTTGGTGCTGGGCCTGCCGTGGCAGCGCGTCGGCACAGGCGGTGTGATCAGCATGCT 
GTAA 

Protein sequence: (SEQ ID NO: 161) 

MIRIDAIWLATEPriDMRAGTETALARVIAVFGAAKPHCAYLFANRRANRMKVLVHDGVGIWLAARRLNQGKFHWPGIRHGCE 
ELDSEQLQALVLGLPWQRVGTGGVISML . 

RL061 

DNA sequences (SEQ ID NO: 40) 

ATGCGCCAACGAAGCTCTTACCCGAAACCGTTCAAAGCCCAGGTCGTTCAGGAATGCCTGCAACCT<3GGGCAACGGTGTCCAG 
TGTCGCCATCAGCCACGGCATCAACGCCAATGTCATCCGCAAATGGCTGACGCTTTATCGAGACCAGCCCGTACCAGCCTCGT 
TACCAGCCTTTGTCCCGCTGAAGGCCACCCCTAAACGGCCAGCCGAAACGTCAGTGCTCATTGAACTGCCCATGGCCGGGCAA 
ATGATCACGGTGAAATAG 

Protein sequence: (SEQ ID NO: 162) 

MRQRSSYPKPFKAQWQECLQPGATVSSVAISHGINANVIRKWLTLYRDQPVPMLPAFVPLKATPKRPAETSVLIELPMAGQ 
MITVK. 
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RL062 

DNA sequence: (SEQ ID NO: 41) 

ATGGCTTTATCTCTTATTCGTAGTCTCACTGCGTCCGCCTCACGAAACATCTCGGCGTTGAAACGCGATGCCAAACGCTTGCA 

GAAGAACTCCTTTCTTGTGTTTGGAACAGAATATCCACTCAAGGTTTGCCAAAATGCGGTAGCAGTTTCTCGCGGCTTCCGCT 

CACTCGCTGATGTCGATAAACTGGAGCAGCACATTGGCATGAATAGAAGCGCTCCATTCTGGGTGATCCGTGGCCGCAACGAT 

ACACACCAGGGGGTACTGGAAGCGCTATATTGTTTAGACCTTGAATATACCGAGAATGGCCCCGTCGTTTTTACTGGAAACCC 

AAAGCACTCTATACTTCCAGCCTTAGTCCTTTTTCTTGAGCAAATGAGCTTTAAGAAACTACCCGGACTAATCCTCATCGAAA 

CAAAAGAGACCTCAATCCAAACAACCCATATATTCGACGCAATAGAAAAATTAGAAGTCGAAGAAACTCTAAATAAATTTCGA 

TTTCTTGACTTGCGAGACCGAAACCTTCCCGTTTCGCTTAGTACCGAGGCTCGTTGCTGGATCGAGTCAATTGTCAGTTTATT 

GCCAAACGACATCCAAGAGGAAATACGTAATAAAGGATGGTCAACTCACTTAGAGATCAGTGCATATGAGCATGCAAAGTCTC 

GTAATCAAGTATTTGGCTCCTCCAACTTCCCTTGCGTCCCCTTCCTCTCCATAAAGTCAGCGATCTATCAACTCATTTCAGGC 

GCATACCCTCCCTTATGGATGCAGCCATCCTCCTCTGGCGAAATATCTAAAGTTGATATACGCCGACCTCCTCTCGAAAAAAG 

CTCAGAGGAAACCTTACTTTATCTCATAAAAAAATTAGAGAATCGACAGTTCCACACAGGCATTTCATGTGAGC^ 

GATGGCGGCCGTATGTCGTACTCTTCTCCAGGAATGATCCGGCTAGCGAGGTACTAGCAGGAGTTATACACTCGTACTTTTCT 

TGGAAGCAAGATAGAGACCATCGCTCACCCACCCTTTATGTTTCAGATGGAGCAGTTCCCTATGCTCCCAAGCTTCTAGGTTT 

AGGCGGCCATACGGTCATTGCAAATGGAATCACTGAAATTCCCGACGGGGATGGTCTTGGGGAGTTCTATGGCTACAAGAACT 

CACTTAAAGTCAGCTCCTTATCTAACGGAATACAGTTCATGGGTAAGCATGTATCACTAAAGTAA 

Protein sequence: (SEQ ID NO: 163) 

MALSLIRSLTASASRNISALKRDAKRLQKNSFLVFGTEYPLKVCQNAVAVSRGFRSIJU^VDKLEQHIGM^ 

THQGVLEALYCLDLEYTENGPWFTGNPKHSILPALVLFLEQMSFKKLPGLILIETKETSIQTTHIFDAIEKLEVEETLNKFR 

FLDLRDRNLPVSLSTEARCWIESIVSLLPNDIQEEIRNKGWSTHLEISAYEHAKSRNQVFGSSNFPCVPFLSIKSAIYQLISG 

AYPPLWMQPSSSGEISKVDIRRPPLEKSSEETLLYLIKKLENRQFHTGISCEHESRWRPYWLFSRNDPASEVLAGVIHSYFS 

WKQDRDHRSPTLYVSDGAVPYAPKLLGLGGHTVIANGITEIPDGDGLGEFYGYKNSLKVSSLSNGIQFMGKHVSLK. 

RL063 

DNA sequence: (SEQ ID NO: 42) 

ATGAACGCTCTGACCCAACCGGCCGCCCTCGCCGCCTCCCACCTGAACATCAACCTGACCGACTTCATCGACGAGTTCGGCGA 
CGAGCTCCTGGAGTCGCTCAATCGCTCCAACCCCCCGGTCTATACCGGCTCCGTCAACGCTCACCGCCAGTTGGTGATGGACC 
GACTCAAGCGCAAGCCCTTCGCGGCCCAGGCCGAGGTCGTCCAGGCCATCACCGCCCTGCTGCTGGACCGTAACGAGGAGGCC 
GGAATCATCAACGCCGAGATGGGCACCGGGAAAACCATGATGGCCATCGCTGTCGCAGCGGTCATGCACC3CCGCCGGCTATCG 
CCGGACCCTGGTCGTCTCTCCGCCGCACCTGGTCTACAAGTGGCGCCGCGAGATCCTGGAGACCATCCCAGCCGCCCGCGTCT 
GGGTACTCAATGGCCCAGATACTCTACTCAAGCTGCTCAAGCTGCGAGATCAGATGGGCGACGCCTACGACGGGCGCCAGGAG 
TTCTTCATCCTCGGCCGCGTGCGGATGCGGATGGGTTTCCACTGGCGGCTCGCCTGCTGGAAGAAACGCGCCGCCGGCGGCCA 
ACTGCTCGCTGCGTGCCCGGATTGCGGACAGGTCCTCGAGGACCTGGAAGGCAACCTGGTCACGGTGGAGGAGTTCGAGCGTG 
GTGACCGTCGACGTACCTGTTCCTCCTGCCGTGGGGCGCTCTGGACGCTGATCCGACCAGGCAAGCCCGACGGCGGCAACCGG 
CGCGCAACGATTCTCAAGTCGATGTGCCGGATACCAACCATCGGCCCGGTCAGGGCGGAGCGCCTGCTGAACGACTTCGGCGA 
GGACTTCCTGGCCACGATGTTGGTGGACAACGTCTCGGAGTTCATCAACCTGATGGACGCCAAGGGCAACTTCGTCTTCAGCG 
ATCGGCAGGCCAAACGCATGGAGCGATCGATGGCAAACATCGAGTTCGGCTTCGGTGAAGGCGGCTACCAACCGACCGAGTTC 
ATCAAGCGCTACCTACCTGATGGCTACTTCGACCTGCTGGTGCTGGACGAGGGACATGAGTACAAGAACAGCGGCTCGGCCCA 
GGGCCAGGCCATGGGCGTTCTCGCAGCCAAGGCACGGAAAACCGTGCTGCTGACCGGAACGCTCATGGGCGGCTACGCCGACG 
ATCTGTTCTATCTCCTGTTCCGCATCCTCACCCAGCGCATGATCGAGGACGGCTATCGGCCCAACGCGCGCGGCAGCATGGCT 
CCCGCAGCCATGTCGTTCATGCGCGACCACGGTGTGCTCAAGGATATCTACACCGAGCGCGACGGTGATTCGCACAAGACAGC 
GCGG(3GCAAGAAGCTCTCGGTACGCACGGTGAAGGCTCCCGGCTTCGGCCCAAAGGGCATCCACCGCTTCGTATTGCCGTTCA 
CCGTGTTCCTGAAGCTCAAGGATATTGGTGGCAACGTACTGCCCGACTACCAGGAGGAGTTCATCGACGTGCCCATGGCGCCT 
GAGCAGGCTTCGGCCTATCAGCGCCTGGC(3GCCACGCTGACAGCGGAGCTCCGCCAGGGTCTGGCGCGACGAGATACCACGCT 
CCTGGraCGTGGTCCTCAACGTGCTGCTGGCTTGGCCGGACTGCTGTTTCCGACCGGAGATCGTCAAGCATCCGCGAACCCGGG 
ACACACTGGCCTTCGTGCCAGCGATCTTCGGTGACGAGCAGTTGATACCCAAGGAGCAGGTGCTGGTGGACCTCTGCTTCGAG 
GAGAAAGCGAAGGGCCGCAAGGTTCTGGCATACACCGTCTACAGCGGGACGCGCGACACCACGTCCAGGCTGAAGAAAGTGCT 
CGAGCAATCCGGGCTGAAGGTGGCAGTGCTACGTGCTTCGGTCGATACCGCTCGACGCGAGGATTGGATCCTCGACCAGGTCG 
ATCGCGGCATCGATGTGCTGATCACCAACCCGGAGCTGGTGAAGACCGGGCTGGACTTGCTCGACTTCCCGACCATCGCGTTC 
CTGCAAACGGGCTACAACGTGTACACCCTGCAGCAGGCCGCGCGGCGGTCGTGGCGGATCGGGCAGAAGCACCCGGTGAGGGT 
GGTGTTCTTCGGCTACGCCGGCAGCTCGCAGATCACCTGCTTACAGCTGATGGCCAAGAAGATCGCTGTGGCTCAGAGCACGT 
CGGGAGACGTTCCCGAGTCAGGTCTCGACTCGTTGAACCAGGATGGGGATTCTGTGGAGATGGCGTTGGCACGACAACTCATC 
GCAGCATGA 
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Protein sequence: (SEQ ID NOx 164) 

MNALTQPAALAASHLNINLTDFIDEFGDELLESIiNRSNPPVYTGSWAHRQLV>roRLKRKPFAAQAEWQAITALLLD 

GI INAEMGTGKTMMAI AVAAVMHAAGYRRTLWSPPHLVYKWRREILETI PAARVWVLNGPDTLLKLLKLRDQMGDAYDGRQE 

FFILGRVT^^RMGFHWRI^CWKKRAAGGQLLAACPDCGQVLEDLEGNLVWEEFERGDRRRTCSSCRGA^ 

RATILKSMCRIPTIGPVRAERLLNDFGEDFLATMLVDNVSEFINMDAKGNFVFSDRQAKRMERSMANIEFGFGEOT 

IKRYLPDGYFDLLVLDEGHEYKNSGSAQGQAMGVLAAKARKTVLLTGTLMGGYADDLFYLLFRILTQRMIE 

PAAMSFMRDHGVLKDIYTERDGDSHKTARGKKLSVRWKAPGFGPKGIHRFVLPFTVFLKLKDIGGNVLPDYQEEFIDVPM^ 

EQASAYQRLAATLTAELRQAIiARRDTTLLGVVLWLLAWPDCCFRPEIVKHPRTRDTLAFVPAIFGDEQLIPKE 

EKAKGRKVLAYTWSGTRDTTSRLKKVLEQSGLKVAVLRASVDTARREDWILDQVDRGIDVLITNPELVKTGLDLLDFPTIAF 

LQTGYNVYTLQQAARRSWRIGQKHPVRWFFGYAGSSQITCLQLMAKKIAVAQSTSGDVPESGLDSLNQDGDSVEMALARQLI 

AA 

RL064 

DNA sequences (SEQ ID NOs 43) 

ATGGCCCTCATGTTCCCGCGCTTGGCGCGCAACTTTGCACGCAACGGCTACTTCCCTACCGATGAGGTCACCCTCGAACGCGC 

TCTGCAGGCCCTCACTCTTGCCCCGTCGGGAAGGATGAGGATCTGTGACCCCTGCGCCGGTGAGGGTGTTGCCCTGGCTGAGG 

CAGCACACACCCTCGGCCGCGATCAGGTCCAAGCCCTCGCTGTCGAGTACGACCGCGAGCGCGCCGACCATGCCCGAGGATTG 

CTTGACCGAGTGCTGCACAGTGACCTTTTCGACACCATGATCAGCAGGCAGTCGTTCGGACTGCTCTGGCTCAACCCGCCTTA 

TGGCGACCTGGTGGCGGACCACTCCGGTGCGTCGCAGTACCAGGGCAGCGGCCGCCGGCGTCTGGAGAAAGCGTTCTACCAGC 

GCTGCCTGCCGTTGCTGCAGTACGGCGGCGTCATGGTTCTGATTGTTCCTCACTACGTCTTGGACGATGAGCTGACTGGCTGG 

TTGAGCAACCACTTCACCGGCCTGCGCATCTACGCAGCCGCGGATCCTACCTTCAAGCAGGTGGTGATCTTCGGCATCCGGGT 

CCGTCGGCAGGACCTGGCCCGGGCGGACGCCAATCAGGTGAGGTCTCGCCTGCAGGCGATCGGAGCGGGCCAGGAAAAGGCCG 

AGGAAATTCCAGCGGCTTGGCCGTGGGAACCCTACGTGGTTCTGCCGGCCACCAGCGAGCTGGAGCACTTCTACCGAGTAACC 

CTGGAGCCGGAGCAGTTCGCCGGTGAAATCCAGCGGCTGCGAGGTCTCTGGCCTGACTTCAACCTGCACTTCGCGCAAGCGGG 

GCTGCAGCCGCGCCCACCAGTCCGCGAGCTGTCTCGCTGGCACCTGGCCCTGGCCCTGGCCGCCGGCGCGATATCTGGCGTCG 

TGCGATCGAAGTCGGGCCGGATCCTGGTCGTGAAGGGTGACACCTACAAGGACAAGGTCCGCAAGACCGAATTCACCGAGGAC 

GACGACGGCAACATCACCGAGGTGAGGATCCTCACCGACCGTTTCATCCCGATCATCCGGGCATGGGAAATGACACCCTCCTC 

GGTCAATCAGGGCCGCGTGCTGACCATCAGCTCCTCGGCCGCGACCACGGAAGAAGCTGAAGAGCCCCAACCTGAGCCGGCCC' 

CCGCACCCGCACCGCTGCTGATCAGCCCTGGCCGGGTCGTAATGACCGCAGCCGTGAGCCACCTGGTGGAAACCGGTCAACTC 

AACCCAGCGCCTTTGCTGAAACGCCATCTGGCGGGAGATTGGGGAACGCTGGACCAGGAAGACTGGAACACCAACCAGAGAGC 

CCTGAAGTTCGGCGATCGGCTGCTGTCGTCCTACGACATCGACGCCGGCGACGAATCCAGGCTCTGGATCATCACTGAGGCAG 

ACCGCAGCTCAACCACGCTTTTGCTCCCTAGCGATTACTGA 

Protein sequences (SEQ ID NOs 165) 

MALMFPRLARNFARNGYFPTDEVTLERALQALTIiAPSGRMRICDPCAGEGVALAEAAHTLGRDQVQAIiAVEYDREI^ 

LDRVLHSDLFDTMISRQSFGLLWLNPPYGDLVADHSGASQYQGSGRRRLEKAFYQRCLPLLQYGGVMVLIVPHYVLDDELTGW 

LSNHFTGLRIYA7UUDPTFKQWIFGIRVRRQDLARADANQVRSRLQAIGAGQEKAEEIPAAWPWEPYWLPATSELEHFYRVT 

LEPEQFAGEIQRLRGLWPDFNLHFAQAGLQPRPPVRELSRWHLAIiALAAGAISGVWSKSGRILWKGDTYKDKVRKTEFTED 

DDGNITEVRILTDRFIPIIRAWEMTPSSVNQGRVLTISSSAATTEEAEEPQPEPAPAPAPLLISPGRWMTAAVSHLVETGQL 

NPAPLLKRHLAGDWGTLDQEDWNTNQRALKFGDRLLSSYDIDAGDESRLWI ITEADRSSTTLLLPSDY . 

RL065 

DNA sequence: (SEQ ID NO: 44) 

ATGCCCAGTCCCACCCCGCTCTACCAGATCGAAGAGTGTCCAGACCTGTACGTCGACGCCTGCGTGTGCGACGAGCAGTGCAA 
CCTGGTCTTTCTTTCGGCCTGGGGCCGCGACACCGTGACACAAGAGTTCCTGGCCAGGCTGACGCTGGGCCGGGAAGAAAATG 
GCATCGACCATTTCCACATCATCGTGGACGGCCGCCGCTTACCTGTCTTCCCAAACCAGGATCTCCTGGAGAAACGCACCACC 
CGTCAGTTCCGCGGCACGTTGTTCGGCAGCCTGCTCAATCTTTGGCTGTTCGATCGGCGCGCCTCGGCGCCCGACCGAGGCAA 
TCACCTCGCCTTCGCACTCCTGCAGCGCGATGAGGATCCACACCAGAGGCTCTGGCCGCTGGTGATGGAAACCTGTCCGCTCC 
CCCTCCTGCAGCACTGGCGCGAGCCGGTGATGGAGGTTCTCACCCAGCACCAGATGTTGACGGCCCTACCCGGGACGATCGGC 
AACGTCTGCGCCTGGCGACTCGCCCTGCGGGTCGACGTGCTTGAGCCCACCCTCGGTGAGGTAATCCGCGAAAGCATTCTTAC 
CACCGATGCTCAGGCGCAAGCCTGA 

Protein sequence: (SEQ ID NO: 166) 

MPSPTPLYQIEECPDLYVDACVCDEQCNLVFLSAWGRDTVTQEFLARLTLGREENGIDHFHIIVDGRRLPVFPNQDLLEKRTT 
RQFRGTLFGSLLNLWLFDRRASAPDRGNHLAFALLQRDEDPHQRLWPLVMETCPLPLLQHWREPVMEVLTQHQMLTALPGTIG 
NVCAWRLALRVDVLEPTLGEVIRESILTTDAQAQA 
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RL066 

DNA sequence: (SEQ ID NO: 45) 

ATGAATCCATTGTTCACCAACCTCACCCAGGAAACCCTCGCCTACCTCGAGGACCAACTGTCCAACAACGACGTCGCCGGCGA 
CGACGAGCTCATCGACTTGTTCATCGAGGAGCTGTCGCTGACCTTGGAGCAGGCGGAAGCGGCTGTCGCGCTACGCGATCAGT 
ACCTCTGCCAGGTCTTCCTGATCGGCCAAGGGCCGCTGCACCAAGCCGATGGACT.CAGCTTCGACCCTCACACCAAGAGCGTT 
CGGTAG 

Protein sequence: (SEQ ID NO: 167) 

MNPLFTNLTQETLAYLEDQLSNNDVAGDDELIDLFIEELSLTLEQAEAAVALRDQYLCQVFLIGQGPLHQADGLSFDPHTKSV 
R 

RL067 

DNA sequence: (SEQ ID NO: 46) 

ATGGGATGGCTTTTCTCACATCAGACGAAGGAAGACCTGCTGCGTGAGCTGCTGGCCCCAACCAGTACCTTCGCAGGCAGCAC 
CGAGGTGCTGGCACACGCAGTCTCCGGCAATGAACTTTGGACTGTCGTAAAACGAACTTTTCACCTTGCCGGATTCTATTTCG 
GCAAGCCGGCCGGTCACTCGATCACC7^TGATCGAGCTGCACTTGCTGGACTGCTCGGCCGGGCAAT<3G(3GCTACAAGACCATT 
CCGGAAAGCGCCGGCCCGTTCTACTACGGCrrGTCCGCTGGAGTTCCTGGACCTGGCTCACGATGAGATCAACCAGGAATGGCG 
TAAACGCCTGACGCACGAACACCAAGCCTGA 

Protein sequence: (SEQ ID NO: 168) 

MGWLFSHQTKEDLLRELLAPTSTFAGSTEVLAHAVSGNELWTWKRTFHLAGFYFGKPAGHSITMIELHLLDCSAGQWGYKTI 
PESAGPFYYGCPLEFLDLAHDEINQEWRKRLTHEHQA 

RL06 8 

DNA sequence: (SEQ ID NO: 47) 

ATGAAATCGATCTACAACACCCCAGGCTTCAGCGAGGAGTTGTTGCTGGTTTGCGCCTCGCTGCGCGAGGTCGGACTGGACAA 
TCTGGCTGACCAGTTCCGCGCGGCAGTGTTCGACCGATCCGTCGTCGACCAGGCCATCATCGCACTGCGTGAGCGGGTGAAGA 
CCCCTTCGCCGGAGCATGCGGCCGACAACGAGCCCTGGTTGTACTGCGACTGGCAGGCCAGGCAAACAGCTTACCGGCTCCTC 
CAGCGCCTTGAGCGCGCAACACGCTGA 

Protein sequence: (SEQ ID NOs 169) 

MKSIYNTPGFSEELLLVCASLREVGLDNLADQFRAAVFDRSWDQAIIALRERVKTPSPEHAADNEPWLYCDWQARQTAYRLL 
QRLERATR 

RIi069 

DNA sequence: (SEQ ID NO: 48) 

ATCCCCTCACCACGATCCCGCTTCGGCGGGATCATCCTTTTCGCAGGTCATACCATGATCACAGTTCCCGGACAGTTGGCCAT 
TCGAACCATCAACGGTCGCTATGGCGAGTTCAATGTGGGAAAACTCTGGACTTCGATCGGGGAGTTCATCATCAAGGATGCCT 
TCCTGGATCAACACACCGAAGGCAAGTACCGCGGTGATTTCGTCATCGCCAATATCCGCCCCCACCACTACTCCGCCGGCGGT 
CGGCTAGTCATCGAGATCCGCGCCATAGTGGACAGCATGACGCTGAACGATATGGACAGCCTCAGCGACGAGGAGGTAGAGCG 
TCTTTCCGGCAATGAGGTGGATCCGCTCGACGAAGTGCCCGAGATCCAGCTCCCCACAGTAGTACCGGCGATACCACCAAAGT 
CGCCGTCACCCCAGAAGTCGAAGCCTCTGTGCCTCGCTGCAACCAGGGACGCGCCTTTCGGTATGGACACTCCGGCTCCTGCA 
GAGCAGGCCGCCTCTCTGGACACAGACGCGGATGCAGAACTGTTCGGGACGGTCTGGCCGCTAGGCGAAATCGTCAAGCTGGA 
CACCACGGTCGACCGCAAGCGACTACGCCAACAGTGCGTGCGACTCGGCGCGCTGGGCTATGAGCTCGACTTCAAACAACAGG 
TGTGGACCCGCAAGGAGGCCGCATGA 

Protein sequence: (SEQ ID NOx 170) 

IPSPRSRFGGIILFAGHTMITVPGQLAIRTINGRYGEFNVGKLWTSIGEFIIKDAFLDQHTEGKYRGDFVIANIRPHHYSAGG 
RLVI EI RAI VDSMTLNDMDS LSDEEVERLSGNEVDPLDEVPE IQLPTWPAI P 

RI.070 

DNA sequences (SEQ ID NOt 49) 

ATGACCTCTCTCAACAACCACTCCAGCGCAGGTCACACTGCTGCGTACCTCAAACTCCCGATCGTTCTCACCAACGCGGCCTG 
GCTGCGCCTGGTCTATCTCGCCAACCCTGCCAGGGTCGACGAGATGGGCACCCGGCTGGCCAGTGTCGTTCAAACCGCCTGGC 
AGGAGCTTTCTCTCCAGCCGACCGCGAAGCACATCCAATTCCACCTGTACCACAAGGAGGAAGAC3GGGCAGGACCGCGCGCTC 
GCGCTGCTGGTTCTCTCGATAGTCGAGCCGTCCGATC3AGCCTTCCTACCTGCGCATCGAGTTGCAGGAAGAGTGCCTCGCCGA 
ACACCCGGTTACCGAGTAG 
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Protein sequence: (SEQ ID NO: 171) 

MTSLNNHSSAGHTAAYLKLPIVLTNAAWLRLVYLANPARVDEMGTRLASWQTAWQELSLQPTAKHIQFHLYH^ 
ALLVLSIVEPSDEPSYLRIELQEECLAEHPVTE . PKSPSPQKSKPLCLAATRDAPFGMDTPAPAEQAASLDTDADAELFGTVW 
PLGEIVKIiDTTVDRKRLRQQCVRLGALGYELDFKQQVWTRKEAA 

RL071 

DNA sequence: (SEQ ID NO: 50) 

ATGACTCAACTCAACCCGTTTATTCGCGGCTATGAGAGTTTCCGCATCGAGCGAAACCTGCAGATCACTGACGAAGGCAACAA 
TCTACCGTGCTACCGCGCTCTGCATGAAACCCAGCAGCACCTCCCAGACGAATATTTTCAGTGCGAGCTGTGCTACTTCAATA 
ACGATTTCGCCGTGGTAGTCCAAGAGTTAGACGATGAAAGAGTTGAAAAATGCCCTCACCAAGGAATAGTGAGAAACGTACTT 
TACAGCATCTACGGTGAGCAGGACGGCAGAAAAAAGCTTATCGGAGATCAATACTCACTGACCGAAGCCGAGAGTGTCGTTCG 
ATACCTTTCGTTCGGCGGCGGTTATAACCCCTGCTGGGAGATCAGAAAAACACATCTACCCATCAGCGCGTGGAATAGCCTCT 
ACGAAAGGTTCTCGACCAAGATGCCAATCCGCTTGCCCTCGGTGTTGGTATCGCTCTTCTGGTGTAACGAGCACGGTGCCGTG 
GGCTTTCGCTTGCACAACACCCCTTGGACGGATGAGTGTCTGGAGATCCTGGAGATGACCGCAGCCGCTCTTCGACAAGAACA 
GCTTGCCTTCGGCCTCGACGAACACCTTGTCGATCTGCTTCACCTCGCGGGACAAGCAGACATTCGGCTCCTGGTACTTGATC 
CATTCGCGCCCACGCTCAAGGGCCTGCCGCTTTATGACGATTGA 

Protein sequence (SEQ ID NO: 172) 

MTQLNPFIRGYESFRIERNLQITDEGNNLPCYRALHETQQHLPDEYFQCELCYFNNDFAVWQELDDERVEKCPHQGIVRNVL 
YSIYGEQDGRKKLIGDQYSLTEAESWRYLSFGGGYNPCWEIRKTHLPISAWNSLYERFSTKMPIRLPSVLVSLFWCNEHGAV 
GFRLHNTPWTDECLEI LEMTAAALRQEQLAFGLDEHLVDLLHLAGQADIRLLVLDPFAPTLKGLPLYDD . 

RL072 

DNA sequence: (SEQ ID NO: 51) 

ATGGGACTGGTGTTTCCTACCGAAAGGAGAATCACCATGCAATACGGAAAGCTGGCGCTCGCCCATCTCAGCCTGGAACTGCC 
GTTGCAGGTACTTATGAATAAGAACCGTGCTTACTACATCGGCACTTCTGACGAAGAAGGACCAGCCTCGCGCGAGTCGGTTG 
AATATTACCCCTCACGCGAACTTGCCCAACAGGCATTAGACCACGGCACTTGGACGCAACTGGAATATTAA 

Protein sequence: (SEQ ID NO: 173) 

MGLVFPTERRITMQYGKLALAHLSLELPLQVLMNKNRAYYIGTSDEEGPASRESVEYYPSREIjAQQALDHGTWTQLEY. 
RI.073 

DNA sequence: (SEQ ID NO: 52) 

ATGGGAAATGTTTGGCGATTATGCCAGGGCAGATACCTGGGCATTGTTGTT(3GCCAGGAACAGCCAGGCGAAGTTGCAGAACT 
GACTGCTGAGCAGCAGCTCGTCCTCGACGTCGCTGAGGCTAACCTCCTCAACTTCCGGCAGGGCGC3GCAGTTCTACGATTTGG 
ATGTTGCTCATGATGATCTCCAGATAATGGAC3AACACCACGCCCTGGGGGC3AGATGGTGCCCCCCGGATGGGTATGCGATGAA 
GAGTGGCGCATAGCGTAG 

Protein sequence: (SEQ ID NO: 174) 

MGNVWRLCQGRYLGIWGQEQPGEVAELTAEQQLVLDVAEANLLNFRQGGQFYDLDVAHDDLQIMENTTPWGEMVPPGWVCDE 
EWRIA. 

RL074 

DNA sequence: (SEQ ID NO: 53) 

CTGACGGGCAAGGTGTTTCTCCGCTTTCGCTTACGAAACTGGAGAATCATCATGAGCAACAACACCCAAGCCCAAGAAGCCAA 
GTATTTCGACCTGCACACCACCGGTATCGGCTACCTCAATCGCATCCGCGAGGTACCGATCCGCCGAGGTGAACCATTCCTCG 
CCGTAACCGTCGCAGCCCTCCATGGCGCGGCAGACAGCGTGGAATACTCCTACATCGACTGCAAAGTGGTCGGCGCCCAGGCT 
GAAAAGCTTGTCCGCCGTTGCAAGGAAGCAGTCGAGGCCAAGAAGAAGGTTCTGATTTCCTTCCGTATCGGCGATATCTGGGC 
GGATCCCTTCATCCACCAGAAAGGCGAGAAACAAGGCAAGCCCGACGCAAGCCTCAAAGGCCGGCTGCTCTTCATCTCCTGGA 

tcaaagtggatggcacc::accgtctacgatgcgaaggaagaagctgaaaaagcccagcaaggcaaaggcgaacctcaaggtgag 
cccgcagcccccgctgagcacgctgaacaagccgctgcttga 

Protein sequence: (SEQ ID NO: 175) 

ltgkvflrfrlrnwriimsnntqaqeakyfdlhttgigyustrirevpirrgepflavwaalhgaadsveysyidckw 
eklvrrckeaveakkkvlisfrigdiwadpfihqkgekqgkpdaslkgrllfiswikvdgttvydakeeaekaq 
paapaehaeqaaa . 
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RL075 

DNA sequence: (SEQ ZD NO: 54) 

ATGTCCAAGCAATCCACCAGCTTCGAAATCGGCTTTGCCCTCGGCAGTGTTGTGCGTGAGTTCCGCAGAGCGCTCAGTCGCCC 
TCCGGTCGTAGTGCAAGCACAAGCGCCGGTTGCGTTGAGAGTCCAGCGCATCGATCCTGCCTTCCTGGCCGGCCCGACCGCTG 
GCGAGCTAGAACACATCAGCGACATCCCAGCCATCGTCCGGCTGAAGAAGGTCAACCTGAATGACTGGTATCTAGCCAATACG 
CGCGAGGTGCAAAAGCCCAAGCGCGCACGCAAACCCAAGCCGGCCAAGGCGACCGCCAAAGCTGAAACGCCAGTCAGGAAGGA 
GCTCAAGATGGGTTCCCTCGACCATTTGATTGCACCCAACTCCGAAAGCGAAATGGGGAGGCCCCCTCTCCAGTTAGAGTCCC 
TGAACGATCATGAGATTGCTCTTTTGCCAGCACCTCCTGGTAGCGCAGTCTCTTGGGAACTCCATCGGCGTACTCAGGAGCAA 
TACCAACAACGCTGGCAGGACTACTTGTCCACCATGACGGATGAACAAGTAGCTGCTCTCGGCCGCTAA 

Protein sequence: (SEQ ID NO: 176) 

MSKQSTSFEIGFALGSWREFRRALSRPPVWQAQAPV7VLRVQRIDPAFLAGPTAGELEHISDIPAIVRLKK^^^JL^rDWYIJ^ 
REVQKPKRARKPKPAKATAKAETPVRKELKMGSLDHLIAPNSESEMGRPPIiQLESLNDHEIALLPAPPGSAVSWELHRRTQEQ 
YQQRWQDYLSTMTDEQVAALGR . 

RL076 

DNA secpience: (SEQ ID NO: 55) 

ATGGTGTTTCTCCTGCAGGTTGAGGGTGCGGAGAAAACACTGGCCCTGGCGGGGAAGTGGATTCCCCGCTGGGTTGCGGAAGG 
GAGCTTCTATCGACCGAGGCCGACCGACCGCGCTACCAGAAGCTATGCGGTCTTGGGTTGGATCAACAGGGTGGGCTGTGCTG 
CAGCATTTCGGATCCGAGCTGCATGGGGGCATGTCGCTGACAACGTCAGCAGATCACGCGTTCATCATCGAAGCGGGGGGCGA 
AAGTGTCAAGGTCAAGCAGGAGGGGGAGCGGATGCAGCGGGCGGAGAGCGAGGGCGGAAGAGCGCGGCTGGTAGAAACCCTGT 
CAAAGGTTTCCCCAGCCGTGTCTGGAAGGGGAGTCAAGTGAGCCACCTGTGGTTGAATCGTCGATCCCTGGGCATTGATCGTC 
TCGATCCCATCACCCGGCCATTATCGTGGCTTGGCCAGCAAACAGTAGGCACGCATCCGCGTACAAAGGGAGCCCTGCGTATC 
ACCGGCGGGCCACCGGCAGGGAGAAGGATCCCGATGGGTAGCCTGATAGTCCTGGAGCAGGAGCATCAGGCTACCCATGGAGA 
GGGGAAAAGGAGGGGCCGTAACACCAGTACGACCCTTAAATCGAGGAAACACCGAACCTCTTGA 

Protein sequence: (SEQ ID NO: 177) 

^^FLLQVEGAE KTKAL AGKW I PR WAEGS FYRP RPTDRATRS YAVLGW INTVGCAAAFRI RAAWGHVADOT 
KCQGQAGGGADAAGGERGRKSAAGRNPVKGFPSRVWKGSQVSHLWLNRRSLGIDRLDPITRPLSWLGQQTVGTHPRTKGALRI 
TGGPPAGRRI PMGSLI VLEQEHQATHGEGKRRGRNTSTTLKSRKHRTS . 

RL077 

DNA sequence: (SEQ ID NO: 56) 

ATGCCGCTGATGTGGATCGTCCrrGGTGCTCGCGCTCATCACCGGGACCTGGCTGAGTGTACAAAGCGACCACGCGACCTCGAG 
CGCCGAACTGGCCGAGGTCGACACCCTGGCCAGGAGCTTGCTGCTCTTCCGGTCCAGTCTGGCGGAGTACGCACACGCCAACC 
CCGGTTTCACCGGTTCGCCGGCGGACTCCGCTCTTGGTTTACCGGCCTGGTTCCGCAAGCCAGCGCGGCTTCAGGGCTACATC 

gccgccggc7vccagctacgccttcatcgccrcgccgccggcggc3gctggcggcggccgtggatgctggtac(3gaatc 

ggttggcgtcaggcgcaacggccagttagtcacgcgccgcctcggagccactgtcattgcgctccctacgcccatccccgagg 

gcgcggtggtcgcggtc:aaataa 

Protein sequence: (SEQ ID NO: 17 8) 

MPLMWI VLVLAIi I TGTWLS VQSDHATS SAELAEVDTLARS LLLFRS S LAE YAHANPGFTGS PADS ALGLPAWFRKPARL^ 
AAGTS YAFI AS PPAGLAAAVDAGTESDLVGVRRNGQLVTRRLGATVI ALPTPI PEGAWAVK . 

RL07 8 

DNA sequence: (SEQ ID NO: 57) 

ATGAGGAGTACGCGCAGCAGTGGATTCATCTCGATCGAACTGATGATCGCCCTCGTCGTGATCGCCATCGCGACCGCCGGTGG 
CATATCGGTCCTGATGAGCTACCT(3GACGGCTTGGACGAGCAGCACGCGGCCCAGCAGCAACAGCAGGTGGCCAAGGCAGCGG 
AGAAGTACCTGAAGGACAACTTCAGCACGGTTCTGGCCAGCGCCGGCGCCACGGCCCCGGCGGTGATCACCGTCCCGATGCTG 
CGCAACACCCGTTACCTGCCCGCAGGCTTCCGCGACACCAACATCTACGGCCAGCAATACCAGGTCCTGGCCCGCAAGCCGGC 
GGCCAACCAGCTCGAAACGCTGATCGTGACCACGGGTGGACAGGTAGCTTCCGAACTCTCGATCCGCCGGATCGCGCAGCTCA 
TGGGAGCCACCGGGGGCTACATCTCGAAAACCAACACCAGTATCGCCCAGGGCGCCGCCTGGCAGGTGGCCTTAAGCAATTTC 
GGTAGCGCTCCCGGCGCTGGACATCTGGCGACGGCGCTGTTCTTCCAGGACGGCGCCATCGCCAACGAGTACCTCTACCGCAA 
TGCCGTCCCGGGTCATCCTGAACTCAACCGGATGAATACCACGCTGGACATGGGAGGCAACAATATCGCCGCAGCCGGGGCGA 
TCACGGCCAGCGGCAACATCACCACCAGCC3CGGACATCAGCGCGCGCAACGTGACAGCCACTGGTACGGTGAAAGCCGGCACT 
GCTGACGTCGCCGGCGAGACGTACACCGGAGGCTGGTTCAGGACCCGTGGTGACACGGGCTGGTACAACGAGAAATGGGGCGG 
CGGCTGGTACATGAGCGACAGCACCTGGGTGCGCTCCTGGATGAACAAGAACGTCTACACCGGCGGCGAGATGATVAGCGGGCA 
AACTCACCGCCGAGGGCCGGACGGAAGTCGGCGAGTACCTACAGCTCAAAGGCGTGGCCACCGAAGGAGCCAACTGCTCGCCG 
AACGGGCTGGCAGGCATCACCAGCACCGGACTCTGGCTGTCCTGCCAAAACGGGAAATGGGGACGAACCGCCGCCTCCATGCG 
CCTGAACACCACGGCCGGCGTGATCAAGGACTGGTGTACGTTGCATGGTCAGGATAGCGCCATGGTGAACTACGACTACGTCC 
GCTACGCGATCACCTGCGGCGGCCGATTCTGCGCAGTGGGCTTCAACCAGACATTTGGCACCAACTACTCGTTCGGGCTAATC 
ACTGAGATCGGCCCAGGCTTCAACTACCCGGAACCCTACAAGACCCCCGACTCGACCAACGTGACCGTTACCTGCGTGAACTA 
G 
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Protein sequence: (SEQ ID NO; 179) 

MRSTRSSGFISIELMIALWIAIATAGGISVLMSYLDGLDEQHAAQQQQQVAKAAEKYLKDNFSTVLASAGATAPAVITVPML 
RNTRYL PAGFRDTN I YGQQYQVLARKPAANQLETL I VTTGGQVASELS I RR I AQLMGATGGY I S KTNTS I AQGAAWQVALSNF 
GS APGAGHI^TALF FQDGAI ANE YLYRNAVPGH PEIJ^RMNTTLDMGGNNI AAAGAI TASGNi TTS AD I S ARN^ 
ADVAGETYTGGWFRTRGDTGWYNEKWGGGWYMSDSTWRSWMNKNVYTGGEMKAGKLTAEGRTEVGEYLQLKGVATEG^ 
NGLAGI TSTGLWLS CQNGKWGRTAASMRLNTTAGVI KDWCTLHGQDSAMVNYDYVRYAITCGGRFCAVGFNQTFGTNYSFGLI 
TEIGPGFNYPEPYKTPDSTNVTVTCVN. 

RL07 9 

DNA sequence: (SEQ ID NO: 58) 

GTGAGTGTGAACCCGATCATCCAGGCTCAGTTCGTCGACCTCTACCTCGGTGAAGGCTTCGCCGACGTGT^AAGGCCTGGCCGG 
CGCCGGCGCGCGCCGAGTCGAAGTGCCTCGCGAGTGGGAGTCGCACGTCCAGGAACTGCTCCAGATCTGCAGGCAAACGCTGG 
AGGAGCTGCAGGACCCTGAGTTCGCCATCGTCGTCGACGGCGTTCTGCTTCGCGTCACCCTCCTCGAAGACGCTTTCAGTGGC 
AGCGTCTTCGTGCTGCGCCGGTCGAGCGCCCAATTGCGGGAGTTCCAAGAGATCGGCTATCCGAGCGAAGTGGTTTCCGCACT 
GATGGATCCGCAGTTGCAGGGCCTGGTCCTGTTCTGCGGCGAGATGGCGACAGGCAAGACCAGCTCCGCCGCCTCTCTGCTCC 
TGGCCCGCCTGCAGGAGTTGGGCGGGGTGGGCTGCGCCGTCGAGGACCCGCAGGAAACCAACCTCAGCGGTCAGCATGGGCTC 
GGCCGCTGCATCCAGGTCAGAACCTCACGGCGCTCAGGCGGATACAGCGAGGCCCTGCTGCGCACGCTGCGGGCCGGCGCCGA 
CCTGGTGCTGATTGGCGAGATCCGCGACGAGGACACCGCCTACCAGGCCTGCAAGGCCTCTCTGACCGGCAGCCTGGTGATCG 
CCACCATTCACGCGAAAAGCTGTCATCAGGCGATCGAGCGCTTGGTGACGCTCGCCCAGCCACTGGCGAGAAACGCCTACGAC 
GTGGTTGCCGAAGGCATCCAAGCTGTGATCTGCCAAGCGCTGGAGAGCGATGGTTCCTCGCGCCGCCTGACCGCCGAGCCACT 
GCTGTTCACTGGCGACGACGGCCCGTCCATGCGCGACAAGATCCGCCGAAAGGAGGCTCATCTGCTGCAGGACGACCAAGCTC 
GCCAGTCCCGGCAAAGCCTATGGAGATAA 

Protein sequence: (SEQ ID NO: 180) 

VSVNPIIQAQFVDLYLGEGFADVKGLAGAGARRVEVPREWESHVQELLQICRQTLEELQDPEFAIWDGVLLRVTLLEDAFSG 
SVFVLRRSSAQLREFQEIGYPSEVVSALMDPQLQGLVLFCGEMATGKTSSAASLLLARLQELGGVGCAVEDPQETNLSGQHGL 
GRCIQVRTSRRSGGYSEALLRTLRAGADLVLIGEIRDEDTAYQACKASLTGSLVIATIHAKSCHQAIERLVTLAQPLARNAYD 
WAEGIQAVICQALESDGSSRRLTAEPLLFTGDDGPSMRDKIRRKEAHLLQDDQARQSRQSLWR. 

RL080 

DNA sequence: (SEQ ID NO: 59) 

ATGAGCACTACGCAACGCACTTCCCGTCCGACGCAGGGCGGTTTCGTTTCCATCGAGATGATCATCGTGCTGATCATCATCGC 
CATCGGGGTCGGCCTGGGCCTGGCCGCAGCGGCTGGAATGTTCAGTTCGTCCAACGCCAACGAGGAACAACGCAACATCAGCG 
TCATTGCGGCCAAC<K:ACGCGCCCTGAAGACCTCTTCG(3GCTACGGCTCCAGCGGTACCAACCTGATCCCCAGCCTGATCGCA 
ATCAACGGCGTGCCGAAGAACATGAGTGTCTCCTCCGGCGTCGTCTACAACGTCTACC3GCGGATCGGTCACTGTCTCGTCCAC 

cggcatgggcttctcgatcaccaccagcaagttgccccaggacgcctgtatcacgctggccaccaagatcgcgaagaacacct 
tcgagcagaccaaaatcaacagcggatcctcgatcaccggagaagtc3accaccgcagccgcgacccaggcctgcagcagcgac 
agcaacagcattacctggacctatagttcgtga 

Protein sequence: (SEQ ID NO: 181) 

msttqrtsrptqggfvsiemiivliiiaigvglglaaaagmfsssnaneeqrnisviaanaralktssgygssgtnlipslia 
ingvpknmsvs sgvvynvyggsvtvsstgmgfs itts kl pqdaci tlatki akntfeqtkinsgs s i tgevttaaatqacs sd 

SNSITWTYSS 
RL081 

DNA sequence: (SEQ ID NO: 60) 

ATGGGGGGCTTCTGC3GAGCAGTTGCA6TTCGCCTTCTACAGCAAGCAGTTCGGCCGCAAGGAACGCCTGCAGTTCTACGAAAG 
CATGTCCACCCTGCTCGAAAACGGGGTCCCGTTGAAC3GATGCTGTC3GCAGAGGTGCAC1AA<3ATCTTCGCTCATGAG<3(3GCAGC 
ATCCGTTTCATCCGGTGGCCATCGCCAGTCGCGAAGCGCTGATGGGGCTGTCCAACGGCAAGCGTCTGGCCACCGCCATGGCG 
CTCTATCTCCCCGCCCAGGAGCGAGCGTTGATCGAGGCC(3GCGAGATGAGCGGCAACCT<3GTTCAC3GCCATGGGCGATGCCGT 
CTCCCTGGTCGAGGCCCAC3GCCAGGATCCGCGCCACCATCTGGCAGGCGCTGCTCTACCCCTCGGCGCTGTCCGCCATGATGG 
TGTTCCTGCTGTGCATCGTGGCCTATCGCATGGTCCCCAGCCTGGCCAGGCTCTCCGACCCAGTCACCTGGACCGGCCCC3CTC 
GCCACGCTCAACGCCATTGCCAGCTTCGTCACAGGACCTGGTATCTACGTTCTGGTCGCCGTCATCACCCTCACGGTGGTGGT 
CATCGTCACGTTGCCGACCTACCGCTGGAAAGGCCGGGTCTGGCTGGACCGGACGCTGCCGCCCTGGTCCATCTACCGCATGC 
TCCAGGGCACCACCTTCCTGCTGAACATGGCGGTCATGCTCAACGCCGGCATACGCCCCTACGACAGCCTGGCCAGCATGATC 
AAGATCTCCCCGCCCTGGCTGAAGCAGCGCTTGGAAGCTGCCCGCTACGGCGTGGGCCTGGGCCAGAACTTGGGTGTTGCCCT 
TCGCAGCGCCGGTCACGATTTCCCCGACCGGCAGGCCATCCAGTACCTGTGCATCCTCGCCAACCGGGGAGGCTTCTCCGAGG 
CGCTGGTCAAGTTCAGCCGCCGCTGGCAGGAGACCAGCCTCAAGCAGATCGAGCTGGCCGCCGGGCTGGTGAAGAACTTCGCC 
CTGATCTTCATCGGCGCGCTGATGATCCTGGTCCTGCTCGGCGCCTACCAGGCACAGCAGCTCATCCAATCCATGAACCACTG 
A 



Title: VIRULENCE-ASSOCIATED NUCLEIC ACIDS AND 
PROTEINS AND USES THEREOF 
Applicants: Laurence Rahme at al. 

Filing Date: September 12, 2003 Serial No.: Not Yet Assigned 
Page 54 of 118 Customer No.: 21 559 



55/118 

FIG. 30J 



Protein sequences (SEQ ID NO: 182) 

MGGFWEQLQFAFYSKQFGRKERLQFYESMSTLLENGVPLKDAVAEVHKIFAHEGQHPFHPVAIASREALMGLSNGKRLATAIV^ 
LYLPAQERALIEAGEMSGNLVQAMGDAVSLVEAQARIRATIWQALLYPSALSAMMVFLLCIVAYRIWPSLARLSDPVTWTGPL 
ATLNAIASFVTGPGIYVLVAVITLTVWIVTLPTYRWKGRWLDRTLPPWSIYRMLQGTTFLLNMAVMLNAGIRPTO 
KISPPWLKQRLEAARYGVGLGQNLGVALRSAGHDFPDRQAIQYLCILA^rRGGFSEALVKPSRRWQETSLKQIELAAGLVKN 
LIFIGALMILVLLGAYQAQQLIQSMNH . 

RL082 

DNA sequence: (SEQ ID NO: 61) 

ATGACGAACCTTCAGATTGCCGCGCTTGCGCAGCCCTCCATGGTGACCCAACTGCTCACCGCCGACGGTGGTGAATGGGAGGT 
ATCGAAGCACCTGCAGGAAATCATGGCTCTGGCTGCCGACGGCACGCTCTATCTATCGGAGAGCCACCAGAACGACATACACG 
TTCTGTCGTTCATCGACCGTCTCGATCGCCGTGGCTTCCGATACCAGCTCAACCTCACCGACCTGCAGACCATTCACCAGCTT 
TACCGCGCCGTCGCCATGGACGGCCTGGTCGATAGCGATGGCCAGCGCGCCACCCAGATGCAGGAGCGCGTGGTCAAGATCAT 
TCGTAAGGCCACTGAGCTGCGCGCCAGTGACGTGCATTTCGTCGTGAGTCCCGCCGGCACCGGCAGCAAGATCCGTTTCCGCG 
TCGACGGCCTGCTGAAGACCGTCGAGCAGTTCCGCAGCCAGGAGCTGCACGAACTCTGTGCAACCATCTACCAATCCATGTGC 
GACGTGGCCGAGCCACTGTTCAAGCCGCAACTGGACCAGGACGCGCGGATGAGCCAGACCTTCGTCGAGAAGCTCAACCTGTT 
CAGTGCCCGGATCGCCACCCGCCCGCGTGCCGGGGGGTTCCTGATGATCCTGCGACTGCTCTACGACGACACCGGCCTCGACA 
GCCTGGAGCAGCTCGGCTACCTGCCCGAGCAGAACGCACTGTTCGATCGCATGATGCGTATGCCCTACGGCATCAACATCCTG 
TCCGGCCCCACCGGGTCAGGAAAGTCGATGACCTTGAAGGTCACCCTGGAAGGCCTCGACAAGCTCCATGGCGGATCCAAGCA 
CATCCTGACCATCGAGGATCCGCCGGAATACCGCATTCGCGGCGAAGGCATCAACCAGACCCCACTGGTCTACGACGCCACCG 
ACCCAGACGCAGAACGCCAGGCCTGGGCCGCGGGCATCGCCAACGGCATGCGCCTGGATCCGGACTACATGATGATCGGCGAA 
GTACGCGACCTCTTCGCCGCTGTCGCCGCGTTCCGTGGTGCGATGACCGGGCACGGCCTATGGTCGACCCTGCACACCAACAG 
CGCGATCGGCATTGTCCAGCGCCTGAAGGACCTGGGCGTCGACCCCGGCTTGCTGTTCGATCCGGCCCTGCTGACCGGCCTGA 
TCAACCAGAGCCTGCTGCCCAAGCTCTGCCCCCACTGCAAAGTGCGCTTCCAAGACCACCAAGACCAACTCGCGCCCGACTTG 
GTCGAACGGGTCCGACGCTTGACCGATGTTTCCCAGGTTCACGTCAAGGGGCCTGGCTGCCAGGCCTGCCGTGGCTCCGGGGT 
CAACGGCCGCTCGATCGTCGCCGAGGTGGTTCTGCCCACCCTCGCCTTCATGCGTGTGTTCGCCAAAGGCGGCCCAGCCGAGG 
CACGCAACTACTGGGTCAAGACCATGCAGGGCATCACCAAGCACGCCCACGCCATCCGCCGCATCAACGAGGGCATGTTCGAC 
CCGCAGATGGTCGAGGATTTCATTGGGCCACTCGACTTCGATGAGCATCTGCTCGACGACAGCTTCTACTCGCAGGAGGCGTG 
CTGA: 

Protein sequences (SEQ ID NOs 183) 

MTNLQIAALAQPS^WTQLLTADGGEWEVSKHLQEIMAIlAADGTLYLSESHQNDIHVLSFIDRLDRRGFRYQLNL^ 

YRAVAMDGLVDSDGQRATQMQERWKIIRKATELRASDVHFWSPAGTGSKIRFRVDGLLKTVEQFRSQELHELCATIYQSMC 

DVAEPLFKPQLDQDARMSQTFVEKLNLFSARIATRPRAGGFLMILRLLYDDTGLDSLEQLGYLPEQNALFDRMMRMPYGINIL 

SGPTGSGKSMTLKVTLEGLDKLHGGSKHILTIEDPPEYRIRGEGINQTPLVYDATDPDAERQAWAAGIANGMRLDPDYMMIGE 

VRDLFAAVAAFRGAMTGHGLWSTLHTNSAIGIVQRLKDLGVDPGLLFDPALLTGLINQSLLPKLCPHCKVRFQDHQDQLAPDL 

VERVRRLTDVSQVHVKGPGCQACRGSGVNGRSIVAEWLPTIJVFMRVFAKGGPAEARNYWVKTMQGITKHAH^ 

PQMVEDFIGPLDFDEHLLDDSFYSQEAC . 

RL083 

DNA sequence: (SEQ ID NO: 62) 

ATGCGAACTGAGCCGATCGGCATGGCCGTGGCCGTGCTCTTCCTCCTCGCGTCTGGCCAGGCCTGCGCTGGCACCGTTGGCGA 
ACTTGCGGAGATCCAGGCCCAGGCCATCCTCACCGAGGCCAAGGTGCGCCTGGCCACGGCGCAGCGGCAATTGGAAGGCAAAG 
GCGAAACCGGCCAGGTCGTCAGCGCCCAGGGGCAGACGTTCGCCATGCCGGTGCCGGCGGCGCCGCCGACGATCACGCAGCCG 
GTTCCGCCAGTGGTGCGGACCATCTACGGCGCCGGCGGCAAGATGACTGCCACGTTCTTGTTCCCGGGCGGGTACGAGGTTGA 
CGCCGCCAGCGGCGCGGAGCTGCCTGGCAAATACCGCGTCGAGTCAATCTCGCTGGACCAGGTCGTGCTCACCGACAAGGACG 
GCAACCGCGTGCCCGTGGGCTTCTCCAGCGTTGCCCCCACCCAAGCCTCCTCTACGGCCCAAGGCGCCTCGGTTCCGCCGGCG 
CTGCCCGGGGCTGTACCGCAGCCGTTCATTCAGTAG 

Protein sequence: (SEQ ID NOs 184) 

MRTEPIGMAVAVLFLLASGQACAGTVGELAEIQAQAILTEAKVRLATAQRQLEGKGET(3QVVSAQGQTFAMPVPAAPPTITQP 
VPPWRTIYGAGGKMTATFLFPGGYEVDAASCSAELPGKYRVESISLDQWLTDKDGNRVPVGFSSVAPTQASSTAQGASVPPA 
LPGAVPQPFIQ . 
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RL084 

DNA sequence: (SEQ ID NO: 63} 

ATGGAGAAGCCTGACCTCGGCAGCCGTGGACCAGACGTCTCGATCCTGAGCTACCACGGCAACAAGTTCGTCAGCGGCCTGTT 
CTGGCGGCCGCTGTCCAGCCAGCGGCAGTACATGAAGGAAGCGCGCAAGCTGGGCAAGGAAGAGCATCTGGACATCGTTGCCA 
TCCGCCATTCACCGACGGTGATCCAGGCCGGCTTCGTTTCGAAGTCGCAAGGCGCAGTCAAGGGGATGTACTCCCTGGCCTCG 
GCGCTTTCAGGCCAGTTCGACGGCGACTTCCTGGCCTGCTGGAAAGTCGACGAGGACCGCTACGCGCTGGTCGCCACGCTCGA 
TGGCGCGATTGTCCCCGGGCAGGATCTGGTCACCACCCTCGACGAGGCCCGGGACCGGGTCAGGAAGCTCTCTACGCGCGGCG 
TGCTGCGAAACGCACAGGTCTTCGTTCCCGAAGGGTTCGATTTCCCCGTCAAGGACTTCGACATCGAGGAACTGCTCGCGCCG 
AAGCGCCTGCGGCGCGACTACCGCCTCCGGCAACTCACCTTCGGCTTGTCCGCCAGGGAGTGGACGGCAGTGGCCCTGCTCGG 
TTGCGTGGTAGGTGGGTCGCTAACCGCCTACTACCTATGGAATGCCCACCAGGAAGAGCTCGCCAGGCAAGCCGCGCTCCTCG 
AGGAGCAGAGGCGCCTCGCCGAGCTGGCCGAGAAGAACGCCCAGGCCAAGCAGCCGCTGGACCTGGCGTCATTGCAGAAGCCT 
TGGACGCTCATACCTGACCTCGAGGACATGCTACGCGCCTGTAGCAAGGCAACGGGGGTACTGTCGCTGTCGATCCAGGGCTG 
GCTCTTCGAATCCAGCAAGTGCGACGGCAGGGTCCTGGTCGCCACCTACCACCGTACCGGCAACAGCACAGCAGCCGACCTGA 
CAGCGGCCAGCCAGCACCTGTTCGCCGACCGCCCCGCCTTCGTCATCGACAACGGCAACACCGCGGCCCTGAAGGTCGATCTG 
AAGGTTGCCATCGGCAGTGATGAGCCGCTACTGCCGGCGGACGACGTTCTGCAGGCGCTGACGAGCCACCTGTACCGTCAAGG 
GGTCGAGCCCAAGCTGTCGATCAGCCAGGAGACAACTCCGCCCCTCCCTGGCGCGGAAGCTGCGACTGAACAGCAAGTGGTGT 
TGCCTTCCTGGAAGAAATTCACCTTCAGCGCCCAGACCCGGCTCCCGGCAGACCTGACCTTCCAGGGGCTGCCCGCTGCCGGT 
GTCCGCATCACCAACCTCGAAACCACGCTCAAGGACAGCCAGTTGGACTGGACTGTCACAGGAGAAATCTATGCGAACTGA 

Protein sec[uence: (SEQ ID NO: 185) 

MEKPDLGSRGPDVSILSYHGNKFVSGLFWRPLSSQRQYMKEARKLGKEEHLDIVAIRHSPTVIQAGFVSKSQGAVKGMYSLAS 
ALSGQFDGDFLAOTKVDEDRYALVATLDGAIVPGQDLVTTLDEARDRVRKLSTRGVLRNAQVFVPEGFDFPVKDFDIEELIiAP 
KRLRRDYRLRQLTFGLSAREWTAVALLGCVVGGSLTAYYLWNAHQEELARQAALLEEQRRLAELAEKNAQAKQPLDLASLQKP 
WTLIPDLEDMLRACSKATGVLSLSIQGWLFESSKCDGRVLVATYHRTGNSTAADLT7VASQHLFADRPAFVIDNGNTAALKVDL 
KVAIGSDEPLLPADDVLQALTSHLYRQGVEPKIiSISQETTPPLPGAEAATEQQWLPSWKKFTFSAQTRLPADLTFQGLPAAG 
VRITNLETTLKDSQLDWTVTGEI YAN . 

RliOSS 

DNA sequence: (SEQ ZD NO: 64) 

ATCGTGTGCGAAGCTACGGCAGATTCCGCGTCTACGATCGCAGCGCAGGTGCGCAACACCCGACCGGATCGGCGCGATACGGT 
GGTGTTCTCCGACAAACCCTGGGTCAGCACGAAACCCCTAAGCGTTTCGCACACCTTGTCCAGTGACTGCATCGTGACGTGGC 
GCCCTGCAGGCGCAGCGTCGCTGCAGGAGGCCGCCCAGGAAGTCATCy^CCAATGCC^CATGGCGGTCAGTATCACGCCCGAC 
GCGCTGAACCCGGCCGCCTTCGCCGTGCAACCTCAGCAGCGCGCGAGCAACGCCCCGCCGCCCATCCAAGGCGGCCAGGACAT 
GGCCACCATGCTGTTTCCTGCCTCCGTCGCCAACGGCATGTCGCTCC3GTGCCGGCGGCAGCATGGGGTCGAGCTTCGC3GTCCT 
ACGGTCCGCGGTCTCTGTACAACATCAAATGGAACGGCAAAGTCAGCGGGTTCCTCGATCTCATCGCCGCCCGAGCCGGCGTG 
TCCTGGCGCTACAACCCAACCGAGAAAAGGGTCGAGTTCTACTACCTGGACACTCGGACCTTCCGCATGTACGCCTTCGACGA 
CGTCAACACGGTGGACTCCACCGTGCGTTCCGGTATGACGACGGCCGCCGGCATCAGCGGGGACGGCTCCGGATCCACCGGAC 
AGAATGGCAGCTCCGGCATCAGCGGCGACTCCGGCAGCAAGCAGACCACCAGCTCGGAGCTGAAGACATCGATCCTCAGCGAC 
ATCGAGAACAGCATCAACTCGATGCTGACGCCGAGCATGGGACGCATGTCGCTGTCGCGTGCCACGGGCACCCTGACCGTCAC 
CGACCGTCCAGAAGTCCTCAACCGTGTCCAGCAGTTGGTCAACCGAGAGAACGAGAGCATCACCAAGCAGGTGCTGCTGAACG 
TCAACGTGCTCTCGGTCGCCCTGACCGACAAGGATCAACTGGGGATCGACTGGAACCTGGTCTACAAGTCGCTCAACAACAAG 
TGGGGCATCGGCCTGAAGAACACCATGCCGGGCATCGATCAAAGCGCGATCTCCGGCTCCGTGAGCATCCTGGATACCGCCAA 
CAGCGCCTGGGCAGGATCCAAGGCCATGGTCCAGGCGCTGGCCCAGCAGGGCCGCGTCTCGACCGTCCGATCCCCGTCCGTGA 
CCACGCTCAACCTCCAGTCGGCGCCGATCCAGATCGGCCGCTACGACAGCTACCTGGCCTCCAGCCAGATCTCCAACGTCGCC 
CAGGTCGGCAGTACCACCTCGCTGATCCCGGGCGCCGTGACCAGCGGCTACAACATGAGCCTGCTGCCGTTCGTGATGGAAAG 
CGGCGAGATGCTGCTGAAGATCAACATCAACATGACCTCCCGGCCGACGTTCGAAATGCAGACCAGCGGGGACTCCAAAGCCC 
AGTTCCCGAGCTACGACATACAACTGTTCGACCAGAAGGTACGTCTGCGCAGCGGCGAGACCTTGGTACTCTCCGGCTTCGAC 
CAGACCACCGAGGACACCAACAAGGTCGGCACCGGCGACGCTGGGTTCTTCGGTCTTGGCGGCGGGCTGACCCGCAATACCAA 
GCGCGAGGTCATCGTGGTGCTGATCACCCCCGTCGTGCTGGGCTGA 

Protein sequence: (SEQ ZD NO: 186) 

IVCEATADSASTIAAQVRNTRPDRRDTWFSDKPWSTKPLSVSHTLSSDCIWWRPAGAASLQEAAQEVINQCHMAVSITPD 
ALNPAAFAVQPQQRASNAPPPIQC3GQDMATMLFPASVANGMSLGAGGSMGSSFGSYGPRSLYNIKWNGKVSGFLDLIAARAGV 
SWRYNPTEKRVEFYYLDTRTFRMYAFDDVNTVDSTVRSGMTTAAGISGDGSGSTC3QNGSSGISGDSGSKQTTSSELKTSILSD 
lENSINSMLTPSMGRMSLSRATGTLTVTDRPEVLNRVQQLVNRENESITKQVLLNVNVLSVALTDKDQL^ 

WGIGLKNTMPGIDQSAISGSVSILDTANSAWAGSKAMVQALAQQGRVSTVRSPSVTTLNLQSAPIQIGRYDSYLASSQISNVA 
QVGSTTSLIPGAVTSGYNMSLLPFVMESGEMLLKININMTSRPTFEMQTSGDSKAQFPSYDIQLFDQKVRLRSGETLVLSGFD 
QTTEDTNKVGTGDAGFFGIiGGGLTRNTKREVIWLITPWLG . 
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RL086 

DNA sequence: (SEQ ID NO: 65) 

ATGACCAGGCAGTTGACCACTCTCACGCTGTGCCTGCTGCTCGCCAGCTGCACGACCCACAAGGCTGAGCCGGCCAGGCCAGC 
CTTCGACAGCAGCCGCAATCCAGACCTGCTTTCTCCGGACCTGTATCCAAACGGTGTGCAGCCGGAGAAAGAGCCCGTAGTGC 
GCTATGGGCGCTACACCCTGGTCAGCACCCAGCCTGATGCCGGTCAACGCGACCTGATGGCCCAGATCATCGACGTi\ACCATC 
CCGTCGAGCATGAACCCGAGCGTCAAGGACGCCATGCAGTACGTGATGAGCCGCTCGGGTTACTCGCTGTGCCCGGCAGACGC 
CGGTCATGTGAACATCCTCTACACCCGGCCGCTGCCGGCAGCTCAGTACAAGCTCGGCCCGATGACCCTGCGCAACACCCTCC 
AGGTCCTCTCCGGCCCAGCCTGGCAGGTTAAGGTCGACGAGGTCGCGCGGCAGGTCTGCTTCGTGCTGCGCCCGGGCTATCAA 
CTTCCCCCGGCGCCGAGGCCGAAACCGGTCCAGCAACTGTATGCGAAGCCCGCTGCCCCAACTCCGCCGGCGGTAGCGCAACC 
CTCCTCCACGGAGAAAGTCAGCACGCTGGAGTCGCCCATCGTGGTCGCCTCGGTGCCGACACCGGCGCCGATCACAACCAGCC 
ACGCTCCGGCCAAGAAGCCTGAATCCACCACTGTGCTCCCCCCAGCCGCACCGGCCAAGGATGGCCACCCCTCTTCTCCTCCC 
GCGGCTTCGGCACCGACCAAGCCTGCGGCCTCCGCCGTGAAGTCCACGCCGCCCACTCCACCCACCGTGGCTTCCGCCCCACC 
GGTCAAGGTGCTCACGCCGCCGGAACCGAGCCGGCCGCTGGCACAGGCCTGGTCAGCCGAGACGGGATCAACCCTGCGCGACA 
CCTTGGAAGCTTGGGCAAAGCGCGCACGCTGGACCGTCCGCTGGGAGCCGCAGGATCTCAACTATCCGATCGAGGCTCCACTG 
ACCTTCCACGGCTCCTTCGAGGACGCGGTATCCGAGCTGTTCCCCCTGTATGACGCTGCCGAACGGCCCTTCCTGGTGAACGC 
CAGCCGCCCGCAGTCCCTGATCATCATCAAGGAGCGCAAGAACTGA 

Protein sequence: (SEQ ID NO: 187) 

l^RQLTTLTLCLLl^CTTHKAEPARPAFDSSRNPDLLSPDLYPNGVQPEKEPWRYGRYTLVSTQPDAGQRDLMAQIIDVTI 

PSSNn^PSVKDAMQYVMSRSGYSLCPADAGHVNILYTRPLPAAQYKLGPl^LRNTLQVLSGPAWQVKVDEVARQVCF^ 

LPPAPRPKPVQQLYAKPAAPTPPAVAQPSSTEKVSTLESPIWASVPTPAPITTSHAPAKKPESTTVLPPAAPAKDGHPSSPP 

AASAPTKPAASAVKSTPPTPPTVASAPPVKVLTPPEPSRPLAQAWSAETGSTLRDTLEAWAKRARWTVRWEPQDLNYPIEAPL 

TFHGSFEDAVSELFPLYDAAERPFLVNASRPQSLIIIKERKN. 

RL087 

DNA sequence: (SEQ ID NO: 66) 

TTGAGCTTTAAATACTATTGGGCTAAATTTTTCTGGGGAGCTTTCTTCTTTGTTTTAGTCGCTTGGAAAGGCTCCGTATTTCC 
AAGCCTGGCATCAGTTAACCCCTTGGTAGTGGCTGGATTCAGTACTATCCTGTTTCCTTTCTCGGTAAGGCTTGTTGAAGACT 
TCGCTTTAAAATATACGGAAAAAGAGTTCTGGGTCACAGGTTTTTTCTCCGAAACCCCTGCAAAAACAGGATTGTATGCAGTC 
TTTTATTTGGCTTGTTATTTGTTTTCAATTCCCTTGGGGATGATTTTTTTATTCTATAAATACGGAAAGGCCTCGTAG 

Protein sequence: (SEQ ID NO: 188) 

LSFKYYWAKFFWGAFFFVLVAWKGSVFPSLASVNPLVVAGFSTILFPFSVRLVEDFALKYTEKEFWVTGFFSETPAKTGL 
YAVFYLACYLFSIPLGMIFLFYKYGKAS . 

RL088 

DNA sequence: (SEQ ID NO: 67) 

ATGTCCAATGACAACGAAGTACCTGGTTCCATGGTTATTGTCGCACAAGGTCCAGACGATCAATACGCATACGAGGTTCCCCC 

TATCGATAGCGCGGCCGTTGCCGGGAATATGTTTGGCGACTTGATTCAAAGAGACATATATCTACAGAAAAACATTTATTATC 

CAGTCCGATCCATTGTTGAACAAGGAACAAAAGAAAAGAAGGAGATCAACAAGAAAGTATCTGATCAAGTCGATGGCTTGCTA 

AAGCAGATCACTCAAGGAAAAAGGGAGGCCACAAGGCAAGAGCGAGTCGATGTCATGTCGGCAGTCCTGCACAAGATGGAATC 

TGATCTTGAAGGATACAAAAAGACCTTTACCAAAGGCCCATTCATTGACTACGAAAAGCAGTCAAGCCTCTCCATCTATGAGG 

CCTGGGTCAAGATCTGGGAGAAGAACTCTTGGGAAGAAAGAAAGAAGTACCCTTTTCAGCAGCTTGTTAGAGATGAACTGGAG 

CGGGCGGTTGCCTACTACAAACAAGATTCACTCTCTGAAGCGGTAAAAGTGCTAAGACAGGAGCTCAACAAGCAAAAAGCGCT 

AAAGGAAAAAGAGGACCTCTCTCAACTGGAGCGGGACTACAAAACCAGAAAGGCGAATCTCGAGATGAAAGTACAATCCGAGC 

TTGATCAAGCGGGAAGTGCTTTGCCTCCATTGGTCAGTCCAACGCCAGAGCAATGGCTTGAACGTGCCACAAGACTGGTTACG 

CAAGCAATTGCTGATAAAAAGCAGCTGCTVGACCACAAACAATACTCTTATCAAGAATGCCCCAACCCCTCTAGAAAACKrAGA^ 

AGCCATCTACAATGGTGAGCTACTTGTGGATGA(3ATAGCCAGTCTACAGACCCGCTTAGATAAGCTGAACGCCGAAACGACAC 

GACGCAGGACAGAAGCAGAACGCAAGGCGGCCCSAGGAACAAGCGTTGCAAGATGCTGTTAAATTTACTGCCGACTTTTATAAG 

GAAGTAACTGAC3AAATTTCK5CGCACGAACATCAGAGATGGCGCACCAACTCK3CCGAAGGCCK:CAGGGGGAAAAAT^^ 

TTCGGCGGAAGCAATCAATTCGTTTGAAAAACTVCT^GGATGCGTTAAATAAAAAACTTAGCCTTAAAGATAGGCT^GCCATTG 

CCAAAGCCTTTGATTCTCTAGACAAGCAGAT<3ATGGCGAAGAGCCTTGAGAAATTTAGCAAAGGCTTTGGAGTTGTAGGCAAA 

GCTATTGACGCCC3CCAGCCTGTACCAAGAGTTCAAGATATCTACC3GAAACCGGGGACTGGAAACCATTCTTTGTAAAAGTTGA 

AACACTAGCTGCTGGTGCC3GCCGCCAGTTGGCTTGTGGGTATTGCATTTGCCACGGCAACGGCCACTCCTATAGGCATCCTGG 

GGTTCGCACTGGTAATGGCAGTTACCGGGGCGATGATTGACGAAGGCCTTCTAGAAAAAGCAAACAACCTTGTAATGTCCATT 

TAA 
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Protein sequence: (SEQ ID NOil89) 

MSNDNEVPGSMVIVAQGPDDQYAYEVPPIDSAAVAGlmFGDLIQRDIYIiQKNIYYPWSIVEQGTKEKKEI^^CKVSDQVDGLL 

KQITQGKREATRQERVDVMSAVLHKMESDLEGYKKTFTKGPFIDYEKQSSLSIYEAWKIWEKNSWEERKKYPFQQLVRDE 

RAVAYYKQDSLSEAVKVLRQELNKQKAIjKEKEDLSQLERDYKTRKANLEMKVQSELDQAGSALPPLVSPTPEQWLERATRLVT 

QAI ADKKQLQTTlSnSTTL I KNAPTPLEKQKAI YNGELLVDE I AS LQTRLDKLNAETTRRRTE AERKAAEEQALQD AVKFTAD 

EVTEKFGARTSEMAHQLAEGARGKNIRSSAEAINSFEKHKDALNKKLSLKDRQAIAKAFDSLDKQMMAKSLEKFSKGFG^ 

AIDAASLYQEFKISTETGDWKPFFVKVETLAAGAAMWLVGIAFATATATPIGILGFALVMAVTGAMIDEGLLEKANI^ 



RL089 

DNA sequence: SEQ ID NO: 68 

ATGAACCGTCCACGCCTGGTTAATCGTACCTCCGCGACACCTTCGACGCTTCTGCAGCGGGCTATCTTCGACGGCTACGACTT 

CGGCTTGAAGATCCCCTACATCGCAGGCAGCAATCGCGCGCTGCTGGAGCTGTCCGGCTTCTTCATCAGCGCCCGGGAGCATC 

CGTTGCACCGCTACTGGCGGGTCCCCAAAGGCAAGCTGCTGCCTGAACTGGACACTCTGTACAACCGTCTCGCCGAGCTAGCT 

GGAGGCCTTCACTCCCAGTCCTGGCGGGAGTTCAGCTCCTTGGTCGAATCCGCGCAGGCCTCGCTTGACCGACAGGCCTTCAC 

CTGGGGGATGCTGCTGCGCATCGCGCCCCTGGCCGAGGGCGGCGTCCTACTGTCAGGCGAGTTCCATCCTGGTGTTGTGGCGG 

TGGCTCGGCGGATGCGCGGGGTATTCCTGCGCCCATCGAGTTCCTGGCGCATCGACACCACTCCCGAGCTGCrCCGAAGC;^ 

CTGATTCTGGAGCTTGGCCTCGCCGAGGAACAATTCGAGATTCTGGATACTGTCCAGGAGCTGCTCAGCGACGGCAGCTTCGC 

GCCGTCGACCGAGCTGCCCAGCATGAGCATCGGCGGTCCACAGCAGGAACCGGCAGCGCCATCCCTGGAGGACGAGTCAGCCT 

CTGACATCTACCTCGCCGCGGTGCCGGAGATCGAGCGCACCGAGTACAGCTCGGCTGATATCGAGGCGGCGCTTCAGGGCTAC 

TCTCTACTGGCCCACCAGCCTGACGGCATCGCTCATCTGCTGCAGAGAACCAGCGCCTTATTGGCCGACGACATGGGATTGGG 

CAAGACCCGCCAGGCGGTCATCGCCGCTTCGATCCGCGCGGCGGGCAGACCAATCCTGGTCATCACCCTGGCTACCCTGCTGA 

TCAATTGGCAGCGGGAGATCCAGGAGGTCTATCCCTCGGCCACCGTGGCCATCCAGCAGGACACCCCAGAGGCGCAGTGGATC 

CTCGTCAACTACGAGCAGTTGAGCCCCTTCGTCGCCAACGCTTCGCGCTTCGCCGTGATGGTCATCGACGAGGCGCAGCGGAT 

GAAGGAACCGACGGCGCAATGCACGCGGCACGGTTTCGACATTGCCGCCCAAGTGCCGAACCGCTACCTGCTTACCGGCACGC 

CGGTGCTCAACCGCGAGACAGAGCTGCACACCCTGCTGCGCCTCTCAGGCCACCCCATCGGCCAACTGCCGCTGAAAGAGTTC 

TGCGACCGTTTCGCCGGCAACCCGGAGTTCCGCCAGAGTCTGCGGGCGGAGCTGGGTGACTGGATGCTGCGCAGGCGCAAAGA 

TGTGCTGCCCAGCCTCAAGGGCAAGCAGCGGCAGTTGCTGAAGGTGGCCCTCTCCACCGAGGAACGCCAGCAATACGACGTGC 

TGCGCCTCGAGGACCGACCGGTCTTCGCGCGACTCGGCGCGCTGCGGCGTTACCTGGAAACGGTGAAAGTTCGCGTGGCGATG 

GACCTGTTGAGCGAGCTCGACGCAGAGGACAAGGTGATCCTGTTCTGCGAGTTCAAGCCGACCGTGGCTGCGCTGAAGGAACT 

CTGCGAGCAGGCCGGACACGGCTGCGTCACGCTGGTGGGCAATGACTCGCTCACCAAGCGGCAGAAGGCGATAGATCGCTTCC 

AGCAGGATCCCGACTGCCGAGTGTTCATCTGCACTACGGCGGCCGCAGGGACGGGCAACAACCTCACTGCGGCGAACTACGTG 

TTTTTCCTCGGCCTGCCCTGGACTCCCGGTCAGCAGGAACAAGCCGAAGACCGCGCGTACCGAAACGGCCAGCTCCGCATGGT 

CGTGGTGAAAATCCCACTGGTCGAGGCCACGATCGACGAGCAACTGTGGCAACtGCTCAACGCGAAACGCCAGGTTGCCCAGG 

ACCTCATCGAGCCCGAGCAGGTCGACGGAAACCGCGCGCTTTTAGCCGCAAGCCTAACTGGATAA 

Protein sequence: (SEQ ID NO: 190) 

MNRPRLVNRTSATPSTLLQRAIFDGYDFGLKIPYIAGSNRALLELSGFFISAREHPLHRYWRVPKGKLLPELDTLYNRLT^LA 
GGLHSQSWREFSSLVESAQASLDRQAFTWGMLLRIAPLAEGGVLLSGEFHPGWAVARRMRGVFLRPSSSWRIDTTPELLRSN 
LILELGLAEEQFEILDTVQELLSDGSFAPSTELPSMSIGGPQQEPAAPSLEDESASDIYLAAVPEIERTEYSSADIEAALQGY 
SLLAHQPDGIAHLLQRTSALLADDMGLGKTRQAVIAASIRAAGRPILVITLATLLINWQREIQEVYPSATVAIQQDTPEAQWI 
LVNYEQLSPFVANASRFAVMVIDEAQRMKEPTAQCTRHGFDIAAQVPNRYLLTGTPVLNRETELHTLLRLSGHPIGQLPLKEF 
CDRFAGNPEFRQSLRAELGDWMLRRRKDVLPSLKGKQRQLLKVALSTEERQQYDVLRLEDRPVFARLGALRRYLETVKVRVAM 
DLLSELDAEDKVILFCEFKPTVAALKELCEQAGHGCVTLVGNDSLTKRQKAIDRFQQDPDCRVFICTTAAAGTGNNLTAANYV 
FFLGLPWTPGQQEQAEDRAYRNGQLRMVVVKIPLVEATIDEQLWQLLNAKRQVAQDLIEPEQVDGNRALLAASLTG. 
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RL090 

DNA sequence: (SEQ ID NO: 69) 

GTGGCACCTCTCGACAACGCCCCCCCTAGCGGGCCGCTACAGGATCCATCCCTGGCCCGCTACAGCGAGCGGCAGCTCGCCGT 
CGCCAACACCTGGGCAACACATTTCTCCCTCGCAGGGACAGCTCGAACCAAGTTCATCCGTCACTACCTGCGCAGCACATCTA 
CGACCAGGTGCTGGTGCATCACAGTCGCTGCTGACAACGGTGTGCGTTACACCATCATGCGTGCAGGGCCGCTACTCCAGGTA 
TTCGACGGTCAACTAATTGGTGCGTGGGAGTGCAAGCCTGCCCATCGTATCCCGGCAAGCACGCCGTCTCGAGCAGGGGCCTT 
GAAGCTGCTACAGCGCCTTCAAAAGTTCGACGACGCAGTTGCTGTACTCAGCTCATACACAAAGCGAGCGCACGACCTAGCCA 
CACAGATGGCCAGGGACGATCTCGGACTTCAACATCGCCTCGTGTATCCGAGCCACAGCAACAAGCGCTACTACGCGCCAAGG 
CACCAGTTCTACTTGAAGCAGATCGGAGCGGTCTTGCGAACCTTCAGACAGGTCCTGGACCAAGACCTGCTGTTCGCCATCCG 
CTCGGTTCGGTGCCTCTCGCCCCAGCTCTACAACTGGCTGGCTCAAGGCGACCAGGTGCGCCGGCTGCAAATGCTGAAGGCTC 
AGCCGGTCTTGACGCCGCTACTGGTGGATTGCGAGGAGGGAGTCTGGCCTCACACGACGACCAACGACAACGGCGAGAGCATC 
CGCCATTACCTTCCTTGCCCCTTTCCCCAGCTTGACAGTGAACGACCGCAGGCCGCCGCCATGCCATGCGACTTGTACCTCGA 
TATGGGCCGTATTCTTGGGCAGGTCGCGGACGAAGGAATTTCGGTCATCAACTTTTTCGCCTGGCTATTTCAGGCGCCGCGGG 
CCTCGATTCGATTTCTTAGTCACGTCAGTCCCGGCCGTGCGGGAGGAGCTCTCTTCCATCGCAAACGGGAAGGCCGACATTCG 
GGATGGCATGCTCTCCTACTGGCGGCATCGCTAGGTAACCGGCGGCCGATCACTCGCGCTCAATGGACAGCATTCTATGCCGC 
CTACAATGCGATCCCTTGGCAAGTTCACAACGCCAAGCCCGACTACAACCGTCTCTTCAACGGCTGCCCGTCGGATTGGCAGG 
ATCCGGCT^TGGCTTGCAATCACTGCACGGCTGAGAGACATCT^GGAGTTCTATACCGCCCTCGACCAGGGGAACTCACAGGTT 
GTTCGGCAGGCGCGCAGCGCCCTGAAAGCGTATCTGGGTCATTGTACCTACCGACAAGCTGGCAACCTGGTGGACGACTACCA 
CCAGGTCCAGAGGGAGCTGCGTGCCGCAGTGCAGAGCAGCCTGCCCGATCTGGTCGACACCGACGAGTACACCACCTGGGAGG 
GAATGCTGTCTGTCGGTCTTATCGATTGCCCTAATGGACTGCAGATCGTCGAGCTCCGCTGTCCTGCCGACCTATATGCCGAA 
CATATCGCTCTGGCACATTGCATCGATAGCTACGACCAGGCCGCCTACCGAGGAGACTGCCGACTGCTCTCAGTACGTGAGGC 
TGGTCGTCCGCTGGCCTCTGCCGAATTGGAGCTCAGGCGTGAGCATGGCGAGCCTATAGGTAGGCCCTGGAGTCCCAAGCACC 
TTTCCACGGTGCAACTGCGCGAATTCGATAATGCCCCCGTGCCGACCGACTCGCCTGCCGGCCAGGCATACCGCTGGTTCATG 
GAACGAATTCGCTCTGGAGCCATAGCGACGAACCTGAACTGGCCCGACATGACCGTCCACATGACGCGCTTCGCCAATGGTCG 
CTGGAAGGCGGGCCTCGCCGAAGCCACGGCGAAGTGGCTGCTCACTCAGTTGGAAGACCGATGA 

Protein sequence: (SEQ ID NO: 191) 

VAPLDNAP PS GPLQDP S LAR YS E RQIJ^VANTWATH FS LAGTARTKF I RH YLRSTS TTRCWC I TVAADNGVR YT I MRAGPLLQV 

FDGQLIGAWECKPAHRIPASTPSRAGALKLLQRLQKFDDAVAVLSSYTKRAHDLATQMARDDLGLQHRLVYPSHSNKRYYAPR 

HQFYLKQIGAVLRTFRQVLDQDLLFAIRSVRCLSPQLYNWLAQGDQVRRLQMLKAQPVLTPLLVDCEEGVWPHTTTNDNGESI 

RHYLPCPFPQLDSERPQAAAMPCDLYLDMGRILGQVADEGISVINFFAWLFQAPRASIRFLSHVSPGRAGGALFHRKREGRHS 

GWHALLLAMLGNRRPITRAQWTAFYAAYNAIPWQVHNAKPDYNRLFNGCPSDWQDPAWLAITARLRDIKEFYTAIJDQGN 

VRQARSALKAYLGHCTYRQAGNLVDDYHQVQRELRAAVQSSLPDLVDTDEYTTWEGMLSVGLIDCPNGLQIVELRCPADLYAE 

HIALAHCipSYDQAAYRGDCRLLSVREAGRPIiASAELELRREHGEPIGRPWSPKHLSTVQLREFDNAPVPTDSPAGQAYRWFM 

ERIRSGAIATNLNWPDMTVHMTRFANGRWKAGLAEATAKWLLTQLEDR. 

RL091 

DNA sequence: (SEQ ID NO: 70) 

ATGCGAAAAGAGAATATATCTGCCG7UVATCACAGAGCGAGCTTTTGATTTTTTCTATTGGTTCTCGCGATTTGAGTTCAGCCT 
CAAAGAGAATGGCTACTTAAAAAATTACAAACCTGGAGCTAGGGCAGAGCCGGGATGGGAAAATTTTGTACAAAACCATTCTG 
ACAAATACTCTCTTTCCCAATCAGCCACAGCACTAATCGAGCAGAGTCCAGAGCAACAAATAGTCCTGCCCGGTAGAGAGCTG 
GGTTGGCGTCCGGTTAAATTAGATGAGGACAAAAGCGACTTAGCTAGAGTCGCTCGCTTACTTAAGACCGTGCGAAACAATCT 
ATTTCACGGAGGCAAGCATGGTGGTGCCAACTGGGACAACCCAGCGAGGACAATACATCTTATTCTTTTAAGTAAAGCTATCC 
TTGACGAGTTTGCTGCACTAGGAGACTTTGAGGCTGACTACAAGAGAATTTACTGA 

Protein sequence: (SEQ ID NO: 192) 

MRKENISAEITERAFDFFYWFSRFEFSLKENGYLKNYKPGARAEPGWENFVQNHSDKYSLSQSATALIEQSPEQQIVLPGREL 
GWRPVKLDEDKSDLARVARLLKTVRNNLFHGGKHGGANWDNPARTIHLILLSKAI LDEFAALGDFEADYKRI Y . 
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RI.092 

DNA sequence: (SEQ ID NO: 71) 

ATGCACATCGTAATCATTGAAGCCCCGGGCAAGCTGAAAAAGCTGAGGTCCCTTCTCCCCTCGATTCGTCCCGACGTGACCTG 
GCAGGTCGAGGCGACAGCCGGCCACATCAGAGACCTACCCGTTCACGGGCAGGATCCGCAGATGCTCACCGTCGGCGTGGGCC 
AGGATTTCAAACCGCACTACCAGATCCTCTCGGGCAAGGAAAAAACCGTCGCACGGCTGAAGGAGCTGCGGCAGAAAGCCGTG 
GAAATCTACGTCGCATCGGACCCGGATCGCGAAGGCGAAAGCATTGGCTGGCACATCCTCCAAGCTGCCGGGATCAAGAACTA 
CAAGCGCGTTGCCTTCAAAGAAATCACAAAGTCATGCATCACCGCCGAACTCAGCTCGCCGCGTCGCCTGGACCTCCCGAAGG 
TCGCCTCGCAGGAATGCCGTCGCGTCATCGATCGCCTGGTGGGGTATCTGGTCACGCCAGAGTTGCGGCGCGTGATGGGTAGG 
CCGACCACCGCCGGGCGCGTGCAGTCCGTCGCGGTGTACCTGGTGGTCCTGCGAGAGCGGGAGATCCGCGCCTTCACAGCAAT 
CAAGCACTTCGGGGTGGAACTGACCTTCGTTTCGCCCAGCGACGGCCGTACCTGGACGGCGGAATGGGATCCAGTGCCCGTGT 
TTGCCAGCGAGGAGTTCCCGTATGTCCAGGATCGTCAACTCGCAGAACTGGTGGGGGCTATACGTAATGTCATCGTCGAGACC 
TGCATTGATAGCGAAGAAACCGATGCGCCTCCGGCACCGTTCATCTCCTCCTCGCTCCAGATGGCCGCCGGGAATGCGCTGAA 
GTGGTCACCCGACTUVGACGATGAAGGTCGCCCAGCGGCTGTATGAACAGGGGCTCATCACCTACCACCGGACGGACAACCCCA 
ATATCTCGAAGGACTCGATGCCGGATATCCGTGCTGTCGCCAAAGCCTTGGGGCTGAAGTGTGTTGAGCAACAGCGGATGTTC 
AAAGCGGACCAAGACGCCCAGGAAGGCCACCCCGCCATCACCCCTACCGACTGGATGGCCGCTGCCGCCGGTGAAACTGCTGA 
TGAGCAGGCGCTGTACCAGCTCATTCGAGTCCGCGCGCTTGCCAGTCAGATCGAAGCTGCCGTGTACGCAGTGAGAACCATCA 
CCCTCCTGGGCGTCGGCCCCGACAAAAAGCCGCTGCGCTTCGGCGCCAAAGGGAAGCTGTTGAACGTGCCTGGCTGGAGAAAA 
CTGCTGCAGGGTGATGACGCCGAGGAGCAGAAGAACGAAACGCCTTCAAACCCCATCCCGATCCCGGCGCTGGAGCCACGCCA 
GATACTCAAGGTCTACAGCGGCGAGGTCCTGGAGAAGAAAACCACCCCTCCCAAGCGATTCACCGACGCCAGCCTGGTGGGCG 
AGATGAAGCGCCGCGGGATTGGTCGGCCATCCTCCTACGCCTCGATCGTGAAGAACATCATCGACAAGGGCCAGGTGCAGATG 
AAGGGGCG7UVGCCTGATCCCCGGCGAGCTGGGAGAGGCCACCATCGCGCTCCTGGAGCACAACTTCAGCTTCCTCAGCCTCGA 
CTTCACCCGCAACCTCGAGGTCGCCTTGGACCGGATCGCCAACAGCGAGGACACCTACATGAACGTGGTCCAGCAGTTCTACC 
AGCTACTACAGACAGAGCTGCAGACACTCCGCGCGCTCCCCAGCGCACAGGACGAACCACGCGCAAGCTCCACCGCCAGTATC 
TCCTCGGCGCCGACCAGCGACTTCCTTTGCGGCAAGTGCGGTCTGCCCCTGGTTCACCGCAAGAAAGCCGGCAAAGGCGGCTT 
CGACTTCTGGGGTTGCAGCGGCTATCGAACAACAGGGTGCAAGGTTAGCTACCCCACCAAGAGCGGCCGGCCTGACTTCGACA 
ACCCGCGCGGGCTATAG 

Protein sequence: (SEQ ID NO: 193) 

MHIVIIEAPGKIiKKLRSLLPSIRPDVTWQVEATAGHIRDLPVHGQDPQMLTVGVGQDFKPHYQILSGKEKTVARLKELRQ^^ 
EIYVASDPDREGESIGWHILQAAGIKNYKRVAFKEITKSCITAELSSPRRLDLPKVASQECRRVIDRLVGYLVTPELRRVMGR 
PTTAGRVQSVAVYLWLREREIRAFTAIKHFGVELTFVSPSDGRTWTAEWDPVPVFASEEFPYVQDRQLAELVGAIRNVIVET 
CIDSEETDAPPAPFISSSLQMAAGNALKWSPDKTMKVAQRLYEQGLITYHRTDNPNISKDSMPDIRAVAKALGLKCVEQQRMF 
KADQDAQEGHPAITPTDWMAAAAGETADEQALYQLIRVRALASQIEAAVYAVRTITLLGVGPDKKPLRFGAKGKLLNVPGWRK 
LLQGDDAEEQKNETPSNPIPIPALEPRQILKVYSGEVLEKKTTPPKRFTDASLVGEMKRRGIGRPSSYASIVKNIIDKGQVQM 
KGRSLIPGELGEATIALLEHNFSFLSLDFTRNLEVALDRIANSEDTYMNWQQFYQLLQTELQTLRALPSAQDEPRASSTASI 
SSAPTSDFLCGKCGLPLVHRKKAGKGGFDFWGCSGYRTTGCKVSYPTKSGRPDFDNPRGL. 

RL093 

DNA sequence: (SEQ ID NO: 72) 

ATGGATCAAAGCCTTTGCACATGCATGCCAACGCCAATCGTCAACCCCAAGGAGCTGCGACTGTGCCACATGTTAGTCGGTAG 
AACTTTCCCGATAACATTGATCGCAGGCGACCATTGGTTGAGCTATGACGGCAGCGCCTGGTGGGTCGATGCGGATGAGCCCG 
CGACGGAGGACGAGGTGGCGGCTCTGTTGGTCAAGGCTGGTGGTGTCACTACGTGCTGGTGCGGATAG 

Protein sequence: (SEQ ID NO: 194) 

MDQSLCTCMPTPIWPKELRLCHMLVGRTFPITLIAGDHWLSYDGSAWWVDADEPATEDEVAALIiVKAGGVTTCWCG. 
RL094 

DNA sequence: (SEQ ID NO: 73) 

gtggcaagggcttccgaatcggaaatctcgaccagtacgaggtgcagtgtgtcaaagagagcgaccgataccgacaagctgga 
c:agacgacacttcaacgatccccaccggactgtacgggctattggtgctgaggccgcgcggaaagggctacgggtgttcgact 
gcccctacagtcatcctgcgatgcgggcgtcctggttgaaagggtttgcccaggagcagcaacagcagctcgacttctga 

Protein sequence: (SEQ ID NO: 195) 

VARASESEISTSTRCSVSKRATDTDKLDRRHFNDPHRTVRAIGAEAARKGLRVFDCPYSHPAMRASWLKGFAQEQQQQLDF 
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RL095 

DNA sec[uence: (SEQ ID NO: 74) 

ATGGCTACCCCCGTCTTCTGGGAAGCCAACATTGGCTCGGCGCCGGAGCACCGCAGCTTCCCCAACGGC7VACAATCCCCCGCG 
GCAGTTGCTGCGACTGAACGTGATGTTCGACAACTCGATTCCCGATGGCCAAGGTGGCTACAAGGATCGCGGCGGCTTCTGGT 
GCAGCGTCGAATGGTGGCATCAGGATGCCCAGCGCTTCGCCGAACTGTTCACGAAAGGTATGCGCGTCAAGGTCGAAGGCAGG 
GCCATTATGGACCGCTGGCCGGACAAAGAGTCAGGCGAAGAAGTCCAGGCGCTGAAGGTCGAAGCCTCGCGCATTTCCATCCT 
TCCGCATCGCCTGGCCGAGGTCACCCTGTTGCCAACCCAGCATCAACAGTCTCGGAACGTCCCGCAGCAACCTGCTCAGCAAG 
ATGCGCAATCGCAGCAGGACTACGACAGCGCCTTCGACGACGACATCCCCATGTA 

Protein sequence: (SEQ ID NO: 196) 

MATPVFWEANIGSAPEHRSFPNGNNPPRQLLRLNVMFDNSIPDGQGGYKDRGGFWCSVEWWHQDAQRFAELFTKGMRVKVEGR 
AIMDRWPDKESGEEVQALKVEASRISILPHRLAEVTLLPTQHQQSRNVPQQPAQQDAQSQQDYDSAFDDDIPM 

RL096 

DNA sequence (SEQ ID NO: 75) 

ATGCGGCAGCTCGATAAGGACCAGCAAGGCGCTCTCKSAACAAAGTGCCTTCCGCCCACTGCAAC^^ 

GCAACACAGTGCCTCACTAAAAGGCCTTTTAAAGCCTTTTAAAGGTAATAGGGAGCTGGCCCAGTTGGCGGAACAGTGCGAAG 

CCAT(X5AGC7VGGGATTGCTTGAACTTGCCC7VGGC3ACTGCTGGCC(:yVCK3TTCGTCGCCCACCCTTCACTCTACT 

CTCATCGAGCAGCGCACATCCGCCCGCACAACCTTTCTCCGCTGGCAGCACATTGCATCCCGGCGGATGGGCGTCGGGGTGTG 

GACGGAAATGCTGCGCCAGGACAAGACCCCGGAATACCTGCTGCAAGACCTCTACGAGATGGAGCTGCAGCGCATCACCCTCA 

ACATGCAGATCAGCCTGATCCACTCCATCGGCAAGCAGGCCGCCGAGTGCGCGGAAAAGATGGGCCAGGCCGAGGCCGAGTTC 

ATGGGCCGACTGCAGCAGAGCACCAACCACCACTGA 

Protein sequence: (SEQ ID NO: 197) 

MRQLDKDQQGALEQSAFRPLQQTAFQALQHSASLKGLLKPFKGNRELAQLAEQCEAMEQGLLELAQGLLAQVRRPPFTLLPTR 
LIEQRTSARTTFLRWQHIASRRMGVGVWTEMLRQDKTPEYLLQDLYEMELQRITLNMQISLIHSIGKQAAECAEKMGQAEAEF 
MGRLQQSTNHH 

RL097 

DNA sequence: (SEQ ID NO: 76) 

ATGGCTGAAACCCATCGGCrrTCAGATCGGCTCTCTCCGCAGCGATGTCGCTCTGACGCTTCACACCTATCACGCCGCCCGCAT 

CTGGACCGGCCGGt^GAAGTCGGATGCCAAGCACAGCATCCTCGGCCTCTCCGGCTTCTGCGCATACGTCSAATCG 

GCGGGGCAGCACAGGACGATCCGTACTCCGACTGGTGGCTGGTTCAGATCGAAGAGAAAGTCGAGAGCTGCCAAGCCGCACTC 

GAGGCCATCGACCAGCGACTGGATGACGTCATGGCCAAGCTGCCCGCGACCCTCC3ATATCTCCGAGAACCTGTCCGTTACACC 

GGTCAAGGTCCCGTTGTTCATCTCCAACCCTCTCGGCTTCAAGGCAGTCTATCTCCTGACCAACTACGACGAACTCGCCCGTC 

GAATCCTGCTGGCCCAGCACGTCGGGCTGGTCGGTCGCCGCGACATGGAGGTCTGGCTCGACGAAGGTGCGTCGGTGCTGCGA 

AGCCTGTTTGGTCTGGCCCAGAGCTACCAGTTCTCGGGCGCCACTCGCGACGACTTCGCCGCAAACAATGCTCGCGCCGAAGC 

CGCGCGGAAGATGTACGAGAAGTTCGGCGAGATCCCGCAGGACATCCTGGAGGGCACTCGACGCTCGAACTTCGCTCCGCCGA 

TCACCCGGGGCCGCTCTGACGGTGATGCCGATGATGACGCTGACCGTGTCGAACTCGAGGACTGA 

Protein sequence: (SEQ ID NO: 198) 

MAETHRLQIGSLRSDVAIiTLHTYHAARIWTGRQKSDAKHSILGLSGFCAYWRMHRGAAQDDPYSDWWLVQIEEKVESCQAA^ 

EAIDQRLDDVMAKLPATLDISENLSWPVKVPLFISNPLGFKAVYLLTNYDELARRILLAQHVGLVGRRDME 

SLFGLAQSYQFSGATRDDFAANNARAEAARKMYEKFGEIPQDILEGTRRSNFAPPITRGRSDGDADDDADRVELED 

RL098 

DNA sequence: (SEQ ID NO: 77) 

ATGTTCTTGAGCATGGCCCCTTTCTTTTTGGTCGTTCTCGTTCTTTCTGCACTTTTTACAGATGCGTGGAACGACCGAGAACT 
CAGGCTGTTGTTAATGCTGATCGTGTTCGGGTATTCAGTAACCGTGTTGACCATTACCSGTTGAGATGTATCGCTTTGAAATGG 
CGGAAAAAGCGATGTGGGGAGCTTTATGCAACAAAGCCAACTACATGAACTGCCAACCAGATTACCAACGGTAG 

Protein sequence: (SEQ ID NO: 199) 

MFLSMAPFFLWLVLSALFTDAWNDRELRLLLMLIVFGYSVTVLTIWEMYRFEMAEKAMWGALCNKANYm 
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RL099 

DNA sequence: (SEQ ID NOt 78) 

ATGAGAAAGTCTCGGTCGGGCGTCGTGTTTTTTGGTGATGCGGCCCGCATCACTCTCCCAGGTCCTGACCTCCGCGCCGCCGG 
CGAGCTGGGTGATTCCACTGGAATCACTCCACCAGGAGCCGACCTCCGCGCCGCCGGCGAGCTGGGTGATTCCACTGGAATCA 
CTCTGCCAGGGATCCACTTCGGTATCGGCGGCAAGATGGGTGTTTCGGGCCGAAACACTTCGCCAAAGCGAGGCATCACCACT 
CACGAGGAACTCAAACAATGTTCTTGA 

Protein sequence: (SEQ ID NO: 200) 

MRKSRSGWFFGDAARITLPGPDLRAAGELGDSTGITPPGADLRAAGELGDSTGITLPGIHFGIGGKMGVSGRNTSPKRGITT 
HEELKQCS . 

RLIOO 

DNA sequence (SEQ ID NO: 79) 

ATGAGGCTGTCGCGCTTTCCCATTTCGACACTGCTGGACTCGGCCTCGGGGCATCTCGAGGCCCATTTGTATAAGAAGCGGCT 
TGCTGCCGAAAGCGGCGAACCGCTGGCTCAACAATATTCCGGCATCATTTTCAGCGGCAATCCTCATGAAACCGTTCCACGGC 
GCCTCCTCCTC3GATAAGCGTCTTACTCCGCTC3GAGCGGAACTGCTGGCAAGTTTTCCGCTTGCTCATCAACGACGACGGACTG 
ACCGCGTTCCCGACATACGAGCAACTGCGCCCCTATCTCC3GTATGCAGCCGGGCAAGATC(3CCTCGCGCGAAACCATCGCCAA 
GGCACTCACGGTCCTTCGTCTGACCCGCTGGCTCAGCCTCGGCCGACGCCTGCGCAACC3ACCTCAACGGACAGGTCCAGGGCA 
ACGTTTACATCCTTCACGACGAGCCTGTCTCTCCAGCCGAAGCCTTGGAGCTGGACACCGACTACATGCAGTTGCTGAGCCAA 
TCCACCGGTCACGGCAACCGAGCCATACGCGAAATCGGGCAGATC:ATCTGGCC3GGAGTTCAGGGATGATCCGGACGTGGGTCG 

ccgcctccctacccatctggagaagctcgagggacgcttgaaccaccagcaatgggctatcgatagtcagctcgaagcggatc 
cagcggcagagttcggcatccgaactctgtcggatttacctcattccaccccgagttcggatgccgaactcagtgaaatcagc 
ggcaagcaatgcgctctaccgctgagttcggataccgaaccccgacagaatccgccgagtacgcccttggttcggatgccgaa 
ctcatatagtacgtatacatacaaacaagattctgtatgtaaaaagccagtacaaccgcgagcacgcgaggaagcccatccga 
actggcaggatctcctgcacgcactggaggccgagcaacggatccaggcagtaagcgcgctcagacgggtgtccgaggatctt 
cggctacccatcatcgagcagtggcagcaccgttgtgccggcggaacagtcagcaatccgttcggctacctcatgacgctcat 
ccagcgtgcagtccagggcaagttcaacgcttcttgggctccggaagaagcggctgagcgaaccatccccgcaacggaacgcc 
ccattcgtgctccggcaccatcaagccccatagcgcctacacagcctcaggtccagccccggggggatacccggacagggagc 
gaggtcctcagccggctcaaggacctcattcggcccaggcacggatcgagcgtgccatccgagcggggtgatgattcatga 

Protein sequence: (SEQ ID NO: 201) 

MRLSRFPISTLLDSASGHLEAHLYKraLAAESGEPLAQQYSGIXFSGNPHETVPRRLLLDKRLTPLERNCWQVFRLLI^ 

TAFPTYEQLRPYIiGMQPGKIASRETIAKALTVLRLTRWLSLGRRLRNDLNGQVQGNVYILHDEPVSPAEALELDTDYMQLLS 

STGHGNRAIREIGQIIWREFRDDPDVGRRLPTHLEKLEGRLNHQQWAIDSQLEADPAAEFGIRTLSDLPHSTPSSD7VELSEIS 

GKQCALPLSSDTEPRQNPPSTPLVRMPNSYSTYTYKQDSVCKKPVQPRAREEAHPNWQDLLHALEAEQRIQAVSALRRVSEDL 

RLPIIEQWQHRCAGGTVSNPFGYLMTLIQRAVQGKFNASWAPEEPAERTIPATERPIRAPAPSSPIAPTQPQVQPRGDTRTGS 

EVLSRLKDLIRPRHGSSVPSERGDDS . 

RLlOl 

DNA sequence: (SEQ ID NO: 80) 

ATGTCGAAGTCGACGATCAATGAAGCGGTCCTGACGCAGGTGCTCAACCACCTGCGCAACGGCCAGCTCAGGCGTTGTGCCGA 
GATGGGGCTGCGGCCGGAGATTCTGGCTCAGCTCCAACAGCCTGCCGTCATGAGCATCCTGACCAATACCCCGGTTTCCTGGG 
TAGATGTCAGAGTGAACATCGACGTCATGGAGAAAATCCTGGCCACAGCCGAGCGCAGCGCGCAGGAAGACCTGCAGATCGAA 
CGCGCACTGAAGCTGGGAGCCACCACAACGATGATCCAGAGCTTTTTCGGTCTGTCGCCGGAGGACACCGCCACCAAGCGCTT 
GATGCTGGAGATCCACCCGCGCCGCGGTCGCTGGCGGCAGCTCGATGAACAGATCGAGCGCCAGATATGGTTCCGCTGGGAGC 
ACCTGATGCAGGAAAATCAGGTCCGCCTTGAAGACAGCATGGAGTTGCTGGACATCGCGATGATCCTCACAGAGGAAATCAAC 
GCCGGAATCGAACAAGACAGTCCAGAATTCATCAGCCTCGCCATTGTTTGGTCTCTCATCCAGAGCTGGTTGAAAGACGGGCT 
CTATCCGTCTGGCAAATCGAGCCAGAGCCAGGCGGGCCTGCAAAAGTCCCAATCCACTCTTTACCTCGCTAGCGTCAGCTCAC 
ACCTGCCCCACTCTGCCCCATCCGCAACAACGCAGGTGAACGCTGAGACAGAACGTCAACAACTACTGAACCTGGTTCAGTCG 
GAAGGCGACACAGCACCy^TGA 

Protein sequence: (SEQ ID NO: 202) 

MS KSTI NEAVLTQVLNHLRNGQLRRCAEMGLRPE I LAQLQQPAVMS I LTNTPVS WVDVRVNI DVMEKI LA 
RALKLGATTTMIQSFFGLSPEDTATKRl^LEIHPRRGRWRQLDEQIERQIWFRWEHLMQENQVRLEDSMELLDIAMILTEEIN 
AGIEQDSPEFISLAIVWSLIQSWLKDGLYPSGKSSQSQAGLQKSQSTLYLASVSSHLPHSAPSATTQVNAETERQQLLNLVQS 
EGDTT^ . 
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RI.102 

DNA sequences (SEQ ID NOs 81) 

ATGAGTATGGCCAAGATCAACCCGCAAGATCTGAAAGACCGGCTACTTGCCCCTGGTTTTACCGCACCGCCCAAGGTTCTGGA 
GCAGTTGTCGGACCCGATCAGCGACACGCCCATGAGGCTGACACTCCACGACGTCCTCCCCTGGCACGATAACCCCCGGACCA 
CTCGAAACCCGAAATACGATGAGCTGAAAGAATCGATCCGACATCGAGGCCTCGATACGCCGCCACCAGTGACTCGACGCCCT 
GGAGAGGACAAATACCGCATTCGCAACGGCGGAAACACTCGCCTGGAAATTCTCAACGACCTCTACAAAGAGACCGGAGACGA 
GCGCTATTTCAGCTTCGACTGCCTGTTCAAGCCCTGGGACAAGCAGCGCGGCGAAATCATCGCGCTGACCGGTCATTTGGCCG 
AGAACGATCTGAAGGGCGACCTCAAGTTCATCGAGCGCGCGGTTGGGGTGCAGAAGGCGAAATTTCTTTACGAACAAGAGAAC 
GGCGGTGAAAGCATTTCCCAGCGCGAGTTGGCACGTCGGCTAAAAGCGGACGGCTACCCTGTATCTCAATCCCATATCAGTAA 
GATGTTAGACACTATTGAGGTATTGGCGCCGGCGATTCCTGTGATGCTGTATTCAGGGCTCGGTAAACCGCAAATCGAGAAAC 
TCCTGTCACTCAGAAAGTCGGCATCCTCCTGCTGGGCACGTCTATACGCTGGTGAAGGGGTTGACTTCGAAATGCTGTTCCAG 
GACACCCTGGCAATCTTCGACAGTAGCCCTGACGAATTCATTTTCGAGCGTTTCCAGGACGAACTCATCGACCAAATGAAGCG 
CCCCCTGGGCCTGCGTTATGACCAAATCCTGCTCGAGATTACCAACGGGCAGCAGGAGCAACGCCGCGGCACTCTGGTCGACC 
TGCCCACACCTGCCGCACCACCTCAACTCCCACCAATTGGGCAGGAAAACCCTGCTGCGTCGTCTACTGGACAAGCACAAACA 
CT^GAGCCCCGCCGCAGATCCCCTU^CGTCCAGGACAAGGAGCAACCCGGGTAATCCCCTCCCCCCGCCGGCTCCGCCACCACC 
TGTCCAACAAAAGCAATTGCCCGATGAGGAGCGTGCGGCGGTCTTGGCAGGCCATATCGTGAGCCCGGTATCGACTAAGATCC 
AGCAGACTCGCCAACGGCTGGCCGGCCTCGAGGGGGT^CATCTACCTGTCTTCGATGAAACAGCTCTGCAGGCAATCCCAGTG 
CAAGTCGGTGGCCTGCACCCGATCACCGATCTCTGGTACATCGAGCGGTCGATCGATACCCCCGAGATCCTGCGACAGCACAT 
CGCTGATCTGGCTGAAGAGATCGCTCTGCATGTCGGCGCCCCAGGCGAGATCGTCAGGATTCAGGGCGGTGTGGGTTACACGT 
ATCGCGAGCCCAATGAAGACCATGAGATTACTGATTCAGCGCTGCACCTCATGACGCTGCTTCAAGCGGTCAGCGGGCAGGTC 
CAAGTCGTTCTGAACACTCACGATCAACAGACCTGCCGCGATGCACTGGGTGAATTCCAGTTCTCAGCTGGCCTCGCTCAGTT 
GCTGCTGGGCCAACCCACCACAAGTGACAAGCCATCCTGCCAGGCAGGCCGTCTCAATGACGAAGCCCTGGTGAAACTGTTCC 
GGATCATTCGTCTTGCCCGACGCCTGGTTGACCTTGAGCTGCCGCCGGCCGCCTCCGAGCAAGCAGCTACTGACCAGTGA 

Protein sequence: (SEQ ID NO: 203) 

MSMAKINPQDLKDRLLAPGFTAPPKVLEQLSDPISDTPMRLTLHDVLPWHDNPRTTRNPKYDELKESIRHRGLDTPPPVTRRP 
GEDKYRIRNGGNTRLEILNDLYKETGDERYFSFDCLFKPWDKQRGEIIALTGHLAENDLKGDLKFIERAVGVQKAKFLYEQEN 
GGESISQRELARRLKADGYPVSQSHISKMLDTIEVIAPAIPVMLYSGLGKPQIEKLLSLRKSASSCWARLYAGEGVDFEMLF^ 
DTLAIFDSSPDEFIFERFQDELIDQMKRPLGLRYDQILLEITNGQQEQRRGTLVDLPTPAAPPQLPPIGQENPAASSTGQAQT 
QSPAADPQTSRTRSNPGNPLPPPAPPPPVQQKQLPDEERAAVLAGHIVSPVSTKIQQTRQRLAGLEGEHLPVFDETALQAIPV 
QVGGLHPITDLWYIERSIDTPEILRQHIADLAEEIALHVGAPGEIVRIQGGVGYTYREPNEDHEITDSALHLMTLLQAVSGQV 
QWLNTHDQQTCRDT^GEFQFSAGLAQLIiLGQPTTSDKPSCQAGRLNDEALVKIiFRIIRLARRLVDLELPPAASEQAATDQ. 

RIil03 

DNA sequence: (SEQ ID NO: 82) 

ATGACCATGGCCCGAGAAACCGAAGATAAGTTCGTTGTCCGTATGCCCTTGGGCTTGCGCGATCAGCTAAAGCAAAAAGCCGC 
GGATAACCACCGTTCGGCCAACIAGCGAGATCGTCTACCGACTGGAGCGCAGCAACGCGCTCGAAGAAGAACTCGCGCGAGCAA 
ACCGAATGGTCGACGAACTCTTCGCCAAGAACCAGCGCCTGCAGGCTGAGCTGGCGGCGGCGAACACGCCTCAGGTGGCGGAG 
GCATGA 

Protein secfuence: (SEQ ID NO: 204) 

MTMARETEDKFVWMPLGLRDQLKQKAADNHRSANSEIVYRLERSNALEEELARANRMVDELFAKNQRLQAELAAANTPQV^ 
A 

RIil04 

DNA sequence: (SEQ ID NO: 83) 

ATGCCTATCAAACACGCCATCGTCCACCTGATCGAGAAGAAGCCCGACGGCACCCCGGCCGTGCTCCACGCGCGCGACGCCGA 
GCTCXSGCGACTCCCAGGCCyVTCGAGAACCTGCTGGCCGACCTCyUVCGAAAGCTACAACGCCAAGAACAAGGCCTGGGGCTTCT 
TCCAGGGCGAGTCCGGGGCCTACCCGTTCAGCGGCT(3GCTCGGCGAGTACCTC3GAGGGCGACCGCGACTTCGTCGGCTTCAGC 
CGCGAAGCGGTCGAGCACCrrGCy^AAAGCTGATGCSACSGAGTCCAATCTCTTCACCGGCCSGCCACGTCCTGTTCGCCCACTACCA 
GCAAGGCATGACCGACTACCTGGCGATCGCCCrGCTG(::7VCCACAGCGAAGGCGTGGC(3GTGAACGAGTCGCTGGAGGTCACCC 
CGTCGCGCCACCTGGACCTCGGCCAGTTGCrACCTGGCCGCGCGGATCAACATTTCCGAATGGCGCAACAACAAGCAGTCGAAG 
CAGTACATCTCGTTCATCAAGGGCAAGGGCC3GGAGGAAGGTCTCCGACTATTTCCGCGACTTCATCGGCTGCCAGGAAGGGGT 
GGATTCGCCGAGCGAGACGCGCACCCTGCTGAAAGCCTTCAGCGATTTCGTGGAAAGCGAGGACATGGCCGAGGAACAGGCCC 
GCGAGAAGACCGAGACGCTGGTCGACTACGCCACCTCGCAGGCGCGCATCGGCGAGCCGATGACCCTCGACGCGCTTTCGGAA 
CTGATGGACGACCAGCAACCGCGGGCGTTCTACGACTACATCCGTAACAAGGACTACGGCCTGTCGCCGGAAATCCCGGCGGA 
CAAGCGCACCCTCAACCAGTTCCGCCGCTTCACCGGCCGCGCCGAAGGCCTGTCGATCAGCTTCGAGGCGCACCTGCTGGGCT 
CCAGGATCGAGTACGACGAGGAGCGCGACACGCTGCAGATCAGCAGCCTCCCCACTCAACTCCGCGACCAGCTCAAGCGGCGC 
AAGGCCCAAATTGGAGAATGA 
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Protein sequence: (SEQ ZD NO: 205} 

MPI KHAI VHL I EKKPDGTPAVLHARDAELGDSQAI ENLLADLNES YNAKNKAWGFFQGESGAYPFSGWLGEYLEGDRDFVGFS 
REAVEHLQKLMEES^rLFTGGHVLFAHYQQGMTDYLAIALLHHSEGVAVNESLEVTPSRHLDLGQLHLAARINIS 
QYISFIKGKGGRKVSDYFRDFIGCQEGVDSPSETRTLLKAFSDFVESEDMAEEQAREKTETLVDYATSQARIGEPMTLDALSE 
LMDDQQPRAF YD YI RNKDYGLS PE I PADKRTLNQFRRFTGRAEGLS I SFEAHLLGSRI EYDEERDTLQI SSLPTQLRDQLKRR 
KAQIGE. 

RL105 

DNA sequence: (SEQ ID NO: 84} 

ATGCGTAGTTTCCTTCGCGGCGCCCGGGAAAGCGTTCGCCGGCTGGTGGCCTTCGCTCAAGCAGAAGGCTGGAGCGTCGACCG 
CTCCGCAGGCGGCCACTTGAAGCTCAGCAAGATCGGCTGCGCCTCGATCTTCATTTCTTCCACGCCAAGCGACGCACGCGGCG 
AGCTCAATGCCCGCGCCCTGCTCCGTCGAGCCGACAGGCAGCGTTCCCTGAACCAGGAGTCTTTCTGA 

Protein sequence: (SEQ ID NO: 206} 

MRS FLRGARES VRRLVAFAQAEGWS VDRS AGGHLKLSKI GCAS I F I S STPSDARGELNARALLRRADRQRS LNQESF 
RL106 

DNA Sequence: (SEQ ID NO: 85} 

ATGCCTGACGTCACAGCCTACCGGCCGCTCGAGCACTTCCAGAAAGTCGAGCTGATGCTTGAGCTCAAGTTGCGTGAAGGTCC 
TTCGTGGATCTGTCTGAACTGCGGCTATCACCTGGATGGCAGCGGCGCACAGCCCTGCCCTGACTGCGGAAAGTCGCGCTACT 
GGACCAGCGGTTGGAGTGTAGGTCGTGGCCATCGCTTCTCGGCAGCAAGGGAAGAGTGGGAAAACCGCCTCAGGACACGGTCG 
CGGTCACCTGTCGCGTCAACGGCACCAGTAGCAACTGACGACGTATGCACTCAACTGCGCACAGAGGTCCGCATGCTGCGTTC 
CGCGCATGACGACCTGGCCTGCAGCCGGCAGAGCGATCGTCGCAGCCTTCAGGCGCTGGTGAAACGTCTCCTGGATGCCGCCG 
CCACCGATAGCCTTCCCCGCTCCCTTGCAGAGATGGAGACCTGGCTGCAGCTCAACAGCGAC3GAGACCACGAATGCGTAG 

Protein sequence: (SEQ ID NO: 207) 

MPDVTAYRPLEHFQKVELMLELKLREGPSWICLNCGYHLDGSGAQPCPDCGKSRYWTSGWSVGRGHRFSAAREEWENRLiRTRS 
RSPVASTAPVATDDVCTQLRTEVRMLRSAHDDLACSRQSDRRSLQALVKRLLDAAATDSLPRSLAEMETWLQLNSEETTNA. 

RL107 

DNA sequence: (SEQ ID NO: 86} 

ATGAAGGCGTCCCAGACCTATCAGTGCATCGTCAAGTTCGATGGCGCCGGTTTCTC3GACCAATACCATTCAGAAGCAGCGTGC 

gacctgcacctggagcgacaaggtggcagcctcccgccttgccgaacgactgtttggcgaggacaacgcatacatcacccgta 
tgccggtacaggcaggcgaccacgaaaagcgc:atcgagagccgctgggcgctgtcctgtagaaatcccaagga<3gtagcgcgc 
gatgcctga 

Protein sequence (SEQ ID NO: 208) 

MKASQTYQCIVKFDGAGFWTNTIQKQRATCrrWSDKVAASRLAERLFGEDNAYITRMPVQAGDHEKRIESRWALSCRNPK^ 

DA 

RL108 

DNA sequence: (SEQ ID NO: 87} 

atgaacactgaagcccgctttccgagtatccacgcctcggccgcgttcaccgactcggcagtggttcatgccaatcacgttgg 
ggtcaaccccatcgagctggacgccctcagccaagtgatctcgcgcctttcgcgggacgagagcacggtcgcaccc:agttcga 

TC3GAGCGAGAGCTTCGTGAGCTGGAGGAACTGGGGTACATCGAAATCTCGACCACCCAC3GCCG(3GACTCTGGTGGTCACTACG 
CGCGCTCCGGGGCAATTGCn'TTCCX3CTTACTTCTGGTCC3GTATGC3ATCCCGCGACACCTGTTCAGCTGCTCGCTGAAAGTGAG 
CCTGGTGCCGCACCTCTGCTGCGGCACTCAGGACTCCCAGCACCTCACCGCCGTGTTCCGCATTGCAGGCAGCAAC3GACGCCG 
CGCGCGAGTTCCTGCATCAGTTGGCCAACAACTATCCCGGGCATGAGCCC3GAGTTGCCCGAACTGGTGGCCGTTCAGGTCGGT 
GATGCACTCAGCAAGGAGGCCGAGTCATGA 

Protein sequence: (SEQ ID NO: 2 09) 

l^TEARFPS IHASAAFTDSAVVHANHVGWPI ELDALSQVI SRLSRDESTVAPS SMERELRELEELGYI EI STTQAGTLVVTT 
RAPGQLLSAYFWSWIPRHLFSCSLKVSLVPHLCCGTQDSQHLTAVFRIAGSKDAAREFLHQIiANNYPGHEPELPELVAVQVG 
DALSKEAES . 
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RL109 

DNA secpience: (SEQ ID NO: 88) 

GTTTCTGCAGATCAGCCGGGAGGACTCGAAGTGAGTGTTCTGGAACTTACGCCGCCGCACTCCGTCGAGGCGGAGCAAGGGGT 

ACTCGGCGGCCTGATGCTGGACAACGCGGCATGGGACATTGTCGGCGATCAGTTGCAGAAGGAGGACTTCTTCCGGCATGAGC 

ATCGGCTGATCTTCACCGCCATCAGCGAGTTGGCCGCGAAGGATGCTCCGTTTGATGTCGTGACTGTGTCGGAAGCGATCGAA 

GACCTTCCAGAAGCTGGCGGGCTGGCCTACCTCGGCCAGCTCGCCGACAACACGCCCTCCGTGGCCAATATCGAGGCTTACGC 

GCAGATCGTTCGCGATCGGGCACACCTGCGGCAGCTGATGTCTCTCGGGCACCACTGCACCAGGACCGCCTCGAACCACCAGG 

CAAATCCCTCTGAGGTTCAGGAGGAGATTGAGCAGAAGCTGTTCGGCCCTTGGCCAGGACCACCACAACGCCGATTTCGTCGA 

TATCAACAAGAGTCTCACGAAGATCGTCGACACCATCGATTACCGCTTCAACAACAACGTGACGGTAACGGGGGTCCCGACTG 

GCCTGAAGGATCTCGACGCACTCACCGGCGGACTACAGAAGTCGGATCTCATCATCGTCGGTGCCCGCCCCGCGATGGGCAAA 

ACGTCGTTTGCCCTCAACCTGGTCGACACCGCGCTCCAGAGCGACCAACAGAAGTCTGTTCAGGTGTACAGCATGGAGATGCC 

GGCAGAGCAGTTGCTGTTCAGGCTTGCCGCCCTGTTCGGCCACCTGGACCTGGGCAAGCTGATGAAGGGCCAACTGCAAGAAG 

AGGATTGGCCCAGACTGTCTGGCGCGATCCAGCGCATAAACGACTATGGCAGCCGGCTGGTCATCAACGATCAGGGCAACCTC 

ACGCCGACAGAGCTGCGCGCCAAGGTTCGCCGGGCGGCCAGGAAGTACGGACACCCCGCGCTGATATTGGTCGACTACCTGCA 

ACTGATGAGTTGCCCAGGCCTGGAGAATCGAGCCACCGAGATCTCGGAAATCTCCCGCTCGCTGAAAGCGCTGGCCAAGGAGA 

TGGACTGTCCCGTCGTAGCTCTATCCCAGCTAAATCGCGGCCTAGAGAACCGGACGAACAAGCGACCGAACTGCGCGGACCTA 

CGAGAGAGCGGCGCAATCGAGCAGGACGCGGACGTGATCATGTTCGTGTACCGCGACGAGGTCTACCACCCCAACACCGAGGC 

CAAGGGCATCGCCGAAATCATCATCGGCAAGTATCGCAACGGTCCGATCGGCACCGTCCACACCGCCTTCATCGCC^ 

CCCGCTTTGCCGACCTGGCGCCGGGGACCTGGCAA 

RLllO 

DNA sequence: (SEQ ID NO: 89) 

ATGACTCGGTCTGCTCTCTCGACCATCGCCTACGAGGCCCTGGTGCGTGCCCGCCGCAAATTCAGCAACCGAGAGGAGCGCTG 
CATCCGCGAAACCTGGACCGCCGAACAGGAACTGGTGCTGCTGCGCCTGTATCCGGATATGCCGAACGAGGTCCTGGCAGCCA 
GGTTGAACAAAACGCTCCAGCAGATCTGCTCCAGAGCGTATCGGCTCGGGCTGAAAAAAAGCCCTGAGTTCTCCAAGAAGATC 
CGGCAGGACTGGGGCAGCGCAACTCGGTTCAAGAAGGGAAACACCCCATGGAACTGCGGCATGAAGGGGCTGCCCGCGCGAGG 
ACGCGCACCAGAAACGCAGTTCAAGAAGGGGCAAAAGCCCCACACATGGCTCCCAGTCGGCAGCACGCGGGTCAGCGCTGATG 
GCTACCTGCAACGAAAGATCTCGGATACCGGCTATCCCCCCCGGGACTGGAAGGGCATCCACATCCTGCTCTGGGAAGAACAC 
TTCGGCCCCATCCCAACCGGCCATTGCGTCTGCTTCAAGGACAACAACAAGCAGAACGTCGTCATCGACAACCTGGAGCTCAT 
CACCCGGGCCGAACGGATGCGCCGCAACTCCATCCATCGCTATCCACCTGAGCTGAAGAGCGCAATCCGCGTCATCAGCAAGC 
TCAAACGCACCATTCAGGAGGTCGAGCATGAAGAACAAGATTGA 

Protein sequence: (SEQ ID NO: 210) 

MTRSALSTIAYEALVRARRKFSNREERCIRETWTAEQELVLLRLYPDMPNEVIAARIiNKTLQQICSRAYRL^^ 

RQDWGSATRFKKGNTPWNCGMKGLPARGRAPETQFKKGQKPHTWLPVGSTRVSADGYLQRKISDTGYPPRDWKGIHILLWEEH 

FGPIPTGHCVCFKDNNKQNWIDNLELITRAERMRRNSIHRYPPELKSAIRVISKLKRTIQEVEHEEQD 

RLlll 

DNA sequence: (SEQ ID NO: 90) 

ATGGACAAGCAAAAAGTCCTCGCCAAGGTCGAGAAGCTGATGGCCCTGGCGAATGCCAAGGGGGCCACGCCGAACGAGGCGGA 
AACCGCATTGCGCCAGGCCGCGATCCTGAAGCGGCAGTTCGACCTCAGCGATGCGGAGATCTCGGCCCACACGGTGGAAACCG 
CGTGCGTTCCCACTCGAACCAGGCGCTCTCCTGCCCCATGGCTGCATGAACTGGCCGGGATCTGCGCCAGTTCCTTCGGCTGC 
GACTACCTGGCGGCATACGCGATGCCAGCGGGCTGGACGTTCAAGTTCATGGGCCGAGGGATCGGCCCTGAGCTGGCCGCTCA 
CGCCTACTCTACGCTCCACCACCAACTGGTGGCAGCGCGCTCGGCTCATGTCGCCCAACAGAAGCGCTGCAAGCTGTCGACCA 
AGCGTCGTCGCAGCAAGCTCTTCGTCGAAGGCTGGCTTCTCGCAGTGCGTTCGCTGGTACGTGAATTTGCTGGCAGGCCGGAC 
GAGTCGACTCAAGCAGCCATCAA<3GCCTACCTCGAACTACACCATCCGGCGTTGAAGTACCTGGAGCCGGCGGCGCTTACGAA 
GGCCCTTGCCTATGACCAGGCCTCGCTGCAAGCAGGCTGGGAGCACGGCAAAAACACTCGCCTGCACCGCGGTGTCAGCCGGC 
GAGTTCAGGGCGCGCTCGAGCAGGGAGGTTCCCAATGA 

Protein sequence: (SEQ ID NO: 211) 

MDKQKVLAKVEKIiMALANAKGATPNEAETALRQAAILKRQFDLSDAEISAHTVETACVPTRTRRSPAPWIJ^ 

DYLAAYAMPAGWTFKFMGRGIGPELAAHAYSTLHHQLVAARSAHVAQQKRCKLSTKRRRSKLFVEGWLLAVRSLVREFAGRPD 

ESTQAAIKAYLELHHPALKYLEPAALTKALAYDQASLQAGWEHGKNTRLHRGVSRRVQGALEQGGSQ 
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RL112 

DNA sequence: (SEQ ID NO: 91) 

ATGAGTGACCCCAAGCTCAAGCCCTGCCCGCTCTGCGGCAGCACGAACATTCGAATGCTGGAACCCGAGCTGCTCGACACC 
GATGCCTGGAACTGTGCCATTGAATGCCTGGACTGCCAGGTTCACATCGGGCCGTCCTACTGCGAGCCAGACCCGGTAACA 
GCGAGGTATTCAGCACAGATCGACTGGAATAGACGCCCAAGCGCAAAAAACCACGCGGACGAGCGTGAGCAGTTCTTGATG 
GCCAACCTGCTCGCCGCCCTGGAGGTCGCACTGGGCGACGTAGCAGCCCTGGCTATTGTCGATCGGGTAAGACAGGCCACA 
GACCGAATTTACCCAACTTCGAACCTCTCCCCTGTTCCGCAGGCCTGGCTCGATGTACAGGCCGAGCGCCGGCGCCAGATC 
ACCGTCGAAGGTTTCGATACCAGCAACGACGACGCTAGCGCTGGCCTGATCGCCCTGGCGGCCGGCTGCTACGCGCTCCAT 
GCCGGCGGCATCGGCACCGACTGGCCGGGCGGCATTCGGAATGGCTCTGCACTGTTCTGGCCCTGGGACGAAGAGTGGTGG 
AAGCCTAAGTCGGCGCGCGAGAACCTGGTACGCGCCGGCGCCCTAGTGCTGGCCGAGATCGAGCGCCTGGACCGCTCCGCC 
ACCGAGCAGGGCTCTUVCCATCTGCAAGGGGGGCGCGTAA 

Protein sequence: (SEQ ID NO: 212) 

MSDPKIjKPCPLCGSTNIRMLEPELLDTDAWNCAIECLDCQVHIGPSYCEPDPVTARYSAQIDWNRRPSAKNHADEREQFIJ^ 

anllaalevalgdvaalaivdrvrqatdriyptsnlspvpqawldvqaerrrqitvegfdtsnddasaglialaagcyalh 
aggigtdwpggirngsalfwpwdeewwkpksarenlvragalvlaeierldrsateqgstickgga 

RL113 

DNA sequence: (SEQ ID NO: 92) 

atgaacctccagaaccgcaacaacctcctactgagcttgatcgccgagacccagttcgacgcctacgtgcaaggctacatg 

gccaaagcaggcgctgccgccggtgcttccgagaatctgcaaatcgaggctgaaggtgcrrgcgatgttgcagggcctggtc 

gctccggttcgcgctcagcagcgtgcctgtggacagtccctgcagaacgcactgctccaaatcgcccacgacctactgttg 

cagacgaaatcgcaactggccatcgcggccaatgccagttcgatccaagtgatccagcgggacatgaacagggc^ 

aacatagctactgccatcgatcacctggccgagttcgcccaaccctcgcaggacactgtgagggtcatcgaacggctc3atg 

ctcttcgtcggcagctcatcaagcactgaaggccagcaactggccgccgaggcaaatgcggtgctcggcatgagcgtggga 

ggcctggcatga 

Protein sequence: (SEQ ID NO: 213) 

l^LQ^^lNNLLLSLIAETQFDAYVQGYMAKAGAAAGASENLQIEAEGAAMLQGLVAPVRAQQRACGQSLQN^ 

QTKSQLAIAANASSIQVIQRD^ttJRAIWNIATAIDHLAEFAQPSQDTVRVIERLMLFVGSSSSTEGQQLAAEANAVLGMSVG 

GLA 

RL114 

DNA sequence: (SEQ ID NO: 93) 

CTGAACAAGTTCGGCAGCGCCGCCGACCTTCGGAGCCAGCAGGCCAAATTGACCGGCGCTACGCGAGAAATACGCAAGCTG 

ACTGGTGGCGGTATCGACCTGTTCGGGAAGCTGGGTTGCTACTTGAGCTTCGAACTW^AAGCAGCTCCTA^^ 

CGCTTGCTCGACTCGGTGAACAAGCAGATCGAGCATGCGAAGGAAAAGCGTGATCGCTACGAGAAA/^AAGCCAAGAAGCGG 

CGCGAGCTACGTGAGCGCCTGGCCAAGCAACTGGTCGCCTCGAACTACCCGCTTCCGGGAAATACGCTCGAAGATCGGCTG 

GAAATCCTGCAGATCGCGTTGATCTACAACCGGGCCyVGGGTGTTCGATCACCTGTACTCCACGCACCAGCTCCACTCAAAA 

CTCAAACGCTGGCTGGAGCGTCCAAAGCAGCTCATCGGATGGCGCAGTGAAGCCGAGTATTTCGCTAGTCAGGTGGGGAGC 

CTGCGATGTGACTTCATTAGCCATCTGACTAACGAAATCGCGTACGACGATGGCAGTGAAGTCGAGGAGCGCCTGCGCGTC 

ATCAAGCAGAAGGTCGCTGACTGCACCGCACAGATCGCTCTGACCAGCGAGGAGCAGGAAACCCTTCGGCTCTGGACAGAC 

GCTCTGCAATCGGCTCCGGAGGGCCTCATATGA 

Protein sequence: (SEQ ID NO: 214) 

LNKFGSAADLRSQQAKLTGATREIRKLTGGGIDLFGKLGCYLSFEQKQLLQDAARLLDSVNKQIEHAKEKRDRYEKKAKKR 
RELRERLAKQLVASNYPLPGNTLEDRLEILQIALIYNRARVFDHLYSTHQLHSKLKRWLERPKQLIGWRSEAEYFASQVGS 
LRCDFXSHLTNEIAYDDGSEVEERLRVIKQKVADCTAQIALTSEEQETLRLWTDALQSAPEGLI 
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FIG. 30V 

RL115 

DNA sequence I (SBQ ID NOx 94) 

ATGAATGCGAAAGCGACTTCGGTTGTATCCACCAAGGGTGGTGTAGGAAAATCCACCACCGCCGCCAACCTCGGTGCATTTTG 
CGCCGATGCAGGCATACGAACCCTCCTCATCGATCTGGACCCCGTCCAGCCCTCCCTATCCTCGTACTACGAGCTGCCGGAAG 
TTGCCCAGGGCGGCATTTACGACCTGCTCGCCGCCAACATAACGGACCCGGCGAGGATCATCTCCAGGACGATTATCCCCAAT 
CTGGACGTCGTGATTTCCAACGACCAGAACAATCAGCTCAACAACCTACTGCTCCAGGCGCCCGATGGCCGGCTACGCCTGGC 
GT^CCTGATGCCCGCTCTGAAAGAAGGCTACGACCTGGTGCTGATCGACACCCAGGGTGCGCGCTCAGCTTTGCTCGAAATGG 
TTGTGCTTGCATCGGACCTGGTTGTTTCCCCCCTCCAACCCAACATGCTTACCGCCCGTGAGTTCAACCGCGGCACCATGCAA 
ATGCTCGACGGCCTACGCCCCTATGAGCGTCTCGGCATGCGGATCCCCAATGTTCAGATCGTCATCAACTGCCTGGACCAGAC 
CAATGACTCCCGGGCAATTCACGAGAATGTGCGTGCCATCTTCGATGAGCATCAGGACATTTCTGTGCTCGAAACGACTGTCC 
CGGATGCCGTCGTGTTTCGCAACGCAGCATCGCGCGGGCTACCAGCGCACCGCCTCGAAACGCGGCAACCCTCCAATCGCACA 
TCAGCGCCCGCGCTGGAAATCATTCGAAACCTGGCCATCGAGGTCTTTCCCGAGTGGACTGACCGCTTCCTGGCGCTGACGCC 
GGGAGGCGGTTGCAGCACTGGTCAAGGGAGGGCGCTGACATGGCGAAGACTCCTATCACCCAAGCCCGCGACGTCGACGCGGA 
ACTTGTGCTGGAACTGA 

Protein sequences (SEQ ID NOs 215) 

MNAKATS WSTKGGVGKSTTAANLGAFCADAGIRTLL IDLDPVQPSLSS YYELPEVAQGGI YDLLAANITDPARIISRTII PN 
LDWI SNDQNNQIJJJNLLLQAPDGRLRLANLMPALKEGYDLVLIDTQGARSALLEIWVLASDL PLQPNMLTAREFNRGTMQ 
MLDGLRPYERLGMRIPNVQIVINCLDQTNDSRAIHENVRAIFDEHQDISVLETTVPDAWFRNAASRGLPAHRLETRQPSNRT 
SAPALEIIRNLAIEVFPEWTDRFLALTPGGGCSTGQGRALTWRRLLSPKPATSTRNLCWN 
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RSOl 

DNA sequence: (SEQ ID NO: 95) 

ATGGGGATCTACCGCCCGAAGCAGTCTCGCCTAGCGATACCGATACTGAGGGGCCGGCTACCGGACGAAAGGTAGCTGCG 
CCTCCCAGCAGTTCGCTAGGCCTGTAGGAAAAATCTGGAATTACCGAGAGCGCCTGGATTCCAGCGCCGGCATGCTGGCA 
GAGCCCCGCAATTTCAAGGCCGAAACCGCAGTACCCTCTGTAATCGCTGATTACGTCGAGGGCACATTGCTACGCCTGCA 
GAATGGTTTCAGGGCCTGAAAAACAGAAAAGCCCACCTAAATAGGCGGGCTATTCCATATTGACATCACGTCAATGCGGG 
CC 

RS02 

DNA sequence: (SEQ ID NO: 96) 

ATGACGCCGCAGCAGCTCACCGAGGAGTACATCTTCGCGCACGATCTCCGAGAAGCCAGCGCGAAGATCTACCGCGCCGC 
GACCAAGGCGCTGCTCAAGCACTTCGGCCCTACGGCAACCGTACAGGAGGTGGACCACAGGTCTGTGCTGGGATGGCGGC 
GCAAGGTCCTGGAACAAGGCCTGTCGAAGCGGAGCTGGAACACGTATTCGAATCATCTGCGAACGATCTGGGGCTATGCC 
ATCGAGCACGAGCTGGTGACACACTCCCAAGTCAACCCGTTCAGAAAGACCACCGTCATCCCCCCCAGGCGAGCAAGCAA 
AACCGTCGCAGCCGAAGCCATCCTGCGCGCCCGCAATTGGCTCAACATGCAGGTCGGCGCCGAGCGCTGCACTGGCGATC 
GCGCACGCATCACTCCCGCCTGGTTCTGGCTTTGCACGTTTGAGGTCTTCTACTTCACCGGCATCCGGTTGAATGCGCTG 
TTGTGCATCCGCAAGCGCGACATCGACTGGGAAAATCAACTGATCCTCATCCGCGGCGAGACAGAGAAAACTCATAAAGA 
GTTCGTAGTGCCAATAACGGAGGGGCTTGTGCCTCACCTATCGCGGCTCCTGCAGGAGGCCGATAGAGCCGGATTCGCCG 
ATGACGACCAGTTGTTCAACGTCAACCGGTTCTCACCGCACTACAAGAGCAAGGTGATGAACTCCGACCAGGTCGAAGCC 
ATGTACCGGAAGTTGACCGAGAAGGTTGGGGTGCGGATGACTCCGCACCGTTTCCGGCACACCCTGGCCACCGACTTGAT 
GAAGGCACCCGAGCGGAACATCCACCTCACGAAGTGCCTGCTCAACCACTCGAATATCCAGACCACCATGAGCTACATCG 
AGGCCGACTACGACCACATGCGTGCCGTGCTGCATGCCAGAAGCCTGGCCCAAGGAGCGCTGGAGAACGTCAGGAAGGTG 
GATTACAGCGGCTCCCCGCAAGCCTCTGCCAAACCGAAGCCATGCGGGCAACCTCTCGCTCGAATGGGTGAAGCGCCGCC 
ACAGGAGGCTAGGACAGAACCTGCAGAACCAAGGGAGCACACACCAGGGACAGGCATTCAGGGAGATGCAACCGCGTGGG 
AAGAAGCGCTACCACAGCCACCTGACACCTTCGAGCAAAGCGTGCTGTTCACTCTGATGGCTCAACACCTATCGAACCGT 
GCCGCCACGGCCTCCGCGGCTTCCACCGCAACAAGCGGATCTGGAGGATGGGGATCTACCGCCCGAAGCAGTCTCGCCTA 
G 

Protein sec[uence: (SEQ ID NO: 216) 

MTPQQLTEEYIFAHDLREASAKIYRAATKALLKHFGPTATVQEVDHRSVLGWRRKVLEQGLSKRSWNTYSNHLRTIWGYA 

lEHELVTHSQWPFRKTTVIPPRRASKWAAEAILRARNWLNMQVGAERCTGDRARITPAWFWLCTFEVFYFTGIRLNAL 

LCIRKRDIDWENQLILIRGETEKTHKEFWPITEGLVPHLSRLLQEADRAGFADDDQLFNWRFSPHYKSKVMNSDQVEA 

MYRKLTEKVGVRMTPHRFRHTLATDLMKAPERNIHLTKCLLNHSNIQTTMSYIEADYDHMRAVLHARSLAQC^ 

DYSGSPQASAKPKPCGQPLARMGEAPPQET^TEPAEPREHTPGTGIQGDATAWEEALPQPPDTFEQSVLFTLMAQHLSNR 

AATASAASTATSGSGGWGSTARSSLA . 

RS03 

DNA sequence: (SEQ ID NO: 97) 

ATGAAATCTGGTATCGCGACCCGTCGCCTCTTCATCAACGACACCAAGGCTTTGGTGCATACCGTCGACGGGACCGCCAT 
GCTGGTCACGCCAGGAATCTTCAAGCGTTATGTCCAGGAGCATCCGGAGGTTGAAAAGCTGGCCCA(3GCCAAGGAGACCG 
CCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAGAAACAGGGTCTTCACCGAAAGACCAGTAAGAACCTGAATATCTGGACC 
ATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGTGCA 
GCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Protein sequence: (SEQ ID NO: 217) 

MKSGIATRRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPEVEKLAQAKETAGWKLVQRAFEKQGLHRKTSI^ 
IKVSGPRKTKELKAYLLQDPKLLFPVQPLDNPSLTVITDAEGGVE 
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RS04 

DNA sequence: (SEQ ID NO: 98) 

ATAGACCAGTTGAGTGAGCAGGAGTCGGTGGAAGTGGTCTGCTCAGCTTTCGATGTGGCGCGGTCTTGCTACTACGTCCA 
CCGTCTTCGACGGCGGCGTGTCGATGCTCGCCGCGTGGCGCTACGCAGCCAAGTCAACCAGTTGTTCAGCCAGAGTCGGG 
GCTCGGCCGGCAGCCGCAGCATTCTGGGCATGCTGCGCGAAGAGGGCGTGACCATCGGCCGTTTCCGAGTGCGTCGGTTG 
ATGCGTGAGCTGGGCCTGGTCAGCAAGCAACCGGGCTCGCACGCCTACAAACAGGCCACGGTTGAGCGGCCGGATATCCC 
GAATCGGCTGAACCGCGAATTCGCGACCGAGCATCCCATACAGGTGTGGTGTGGCGACATCACCTACGTCTGGGCGCAAG 
GCCGTTGGCACTACCTGGCCGCGGTGCTGGATCTGCTGATCGGCTGGGCGTTCTCGGCCAAGCCGGATGCCGAACTGGTG 
ATCAAGGCCCTGGACATGGCCTACGAACAGCGCGGCAGGCCACAGCAGGTGCTGTTCCATTCAGACCAGGGCAGCCAGTA 
CGCCAGCCGCCTGTTTCGGCAACGGCTCTGGCGCTATCGGATGCAGCAGAGCATGAGCCGTCGGGGGAATTGCTGGGATA 
ACTCGCCGATGGAGCGCCTGTTCCGCAGTCTGAAGTCGGAGTGGGTCCCGTCAACGGGTTACCTGACGGCGCAGGAGGCC 
CAACGGGACATCAGTCATTACTTGATGCACCGCTACAACTGGATCAGGCCGCATCAATTCAACGACGGGTTACCACCTGC 
GGTGGCCGAAGAAAAACTCAACCCACTGTCCGGGATGGGTTGA 

Protein sequence: (SEQ ID NO: 218) 

IDQLSEQESVEWCSAFDVARSCYYVHRLRRRRVDARRVALRSQVNQLFSQSRGSAGSRSILGMLREEGVTIGRFRVRRL 
MRELGLVSKQPGSHAYKQATVERPDIPNRLJ^REFATEHPIQWCGDITYWAQGRWHYLAAVLDLLIGWAFSAKPDAELV 
IKALDMAYEQRGRPQQVLFHSDQGSQYASRLFRQRLWRYRMQQSMSRRGNCWDNSPMERLFRSLKSEWVPSTGYLTAQEA 
QRDISHYLMHRYNWIRPHQFNDGLPPAVAEEKLNPLSGMG 

RS05 

DNA sequence: (SEQ ID NO: 282) 

ATGAGCAAGCAACGACGTACGTTTTCCGCCGAGTTCAAACGAGAGGCCGCGGCCCTGGTGTTGGACCT^GGCTACAGCCA 
TATCGACGCCTGCCGTTCGCTGGGGGTGGTGGATTCGGCCTTGCGCCGTTGGGTGAAGCAGCTCGAGGCGGAGCGCCAGG 
GTGTGACCCCGAAGAGCAAGGCGTTGACGCCTGAGCAGCAAAAGATCCAGGAGCTGGAAGCCCGGATCAACCGATTGGAG 
CGGGAGAAAGCGATATTAAAAAAGGCTACCGCTCTCTTGATGTCGGACGAACTCGATCGTACGCGCTGA 

Protein sequence: (SEQ ID NO: 219) 

MSKQRRTFSAEFKREAAALVLDQGYSHIDACRSLGVVDSALRRWVKQLEAERQGVTPKSKALTPEQQKIQELEARINRLE 
REKAILKKATALLMSDELDRTR 

RS06 

DNA sequence: (SEQ ID NO: 99) 

ATGTTGTATTTTTCTTGCAGTATGAAGATGGGTGGTTGGGTCGGATATAGGTACTTCTCTCTATTTTCTTTAATTGCTCT 
CATCTATGGGTGTGTCGGTGGTGGAGGTGGATCGGATGAGATTGGGCAGCACTGCTTTGAGAGAGAGCAAAAGCTTTCCG 
GAGTTAATGATAATGAAGAGGGGAGTGTGAGGTTGAATCGGCTGAACTGCGATCCT^TTGAAGGTCGTGTTCTTGAATCA 
GAGAAGCTGATAAGAAAGCCGCCCAATGAGCTGGGTATTCACTGA 

Protein sequence: (SEQ ID NO: 220) 

MLYFSCSMKMGGWVGYRYFSLFSLIALIYGCVGGGGGSDEIGQHCFEREQKLSGVNDNEEGSVRLNRLNCDPIEGRVLES 
EKLIRKPPNELGIH 
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RS07 

DNA sequence: (SEQ ID NO: 100) 

ATGAAAAAATCACTTGTTATGTCGGCTGTGCTTTTGGTGGCTAGCAATTTCGCGTGTGCTGATGAGGGCTCAAATGA 
TGGAAGTGAGATATGTCGGGCGCAGGGTGGAGTTGAAATAACAAGTCTGGGGGAAGTCTCAAAGGGTGTGGATGTTG 
AAGATGTTGTAGTTTGTTCGATTCTTCCAAGTAATATGAAGTCGAGTCAAAGAGCGCCTACACTCCCTCCTCTGCAA 
AGGATGATCATTTCGGCAATGCCTTCACCAGGAACGGTCACTGTTTCTGCCAGCGGAGATAGGAAATTTACAACATC 
TTGCCGGGCAAATCTTTATGCTCCACGTTATGCCAATTTCTATCCAGACGGTGTTAGCAGGGGAACATCAGATCTAC 
GATGTGTTGGTTACAATACACCCGGGAATTCATCTCAAGGGTGTAATGTGTCATGGGACGGCCCGACCGACATTCAA 
TTGGGTGTTGAGCCATATGGCGGATCTGTTGTTGTAAACTACAGTTGCACTGCATTCAAAACAACGATTCCAGTGAT 
AATGAGCTACAGTTATCGTGATGGGCGGGCAGTGTATGGCGAGGTCCAGAATGTGTCAGGAATAATAAATGTGGTTT 
TGAACTAA 

Protein sequence: (SEQ ID NO: 221) 

MKKSLVMSAVLLVASNFACADEGSNDGSEICRAQGGVEITSLGEVSKGVDVEDVWCSILPSNMKSSQRAPTLPPLQ 
RMIISAMPSPGTVTVSASGDRKFTTSCRANLYAPRYANFYPDGVSRGTSDLRCVGYNTPGNSSQGCNVSWDGPTDIQ 
LGVEPYGGSVWNYSCTAFKTTI PVIMS YS YRDGRAVYGEVQNVSGI INWLN 

RS08 

DNA sequence: (SEQ ID NO: 101) 

ATGCTTATT7U\AATTCTTCGAATTATATTCTTGTTGCCTATAGTTGGTTTGGCACAGCAGGCTGCTGCCTCCCCGCC 
CGCAGAGTCACACTCGGAACAATCTGAATCTTCGTGTATCGATGTCCAAGTCAATGGAGCACGTAGCCTGTCTTATA 
ACTGCATGGCTCAGCAAATGACTCCACCCAAAGAGGATCCTCGGCGTCGGAACCCTACCTTGAACTCCACATTAGCG 
TCTGAACGCGCCACTCGCCTGCCACCCACACAGACA6GACTTTTTACCAGCCTTCATCAACGTGCCATATCGAACTC 
GAAAGACTAG 

Protein sequence: (SEQ ID NO: 222) 

MLIKILRIIFLLPIVGIAQQAAASPPAESHSEQSESSCIDVQWGARSLSYNCMAQQMTPPKEDPRRRNPTLNSTLA 
SERATRLPPTQTGLFTSLHQRAISNSKD 

RS0 9 

DNA sequence: (SEQ ID NO: 102) 

GTGAGTAGTACTAAGAGTAAGCCGATAGCCAGGGGGCGTGGTGGCCCATTTGGGGAAGTGATGAAGAGGTGCGGGCT 
TGTACCGGTTCGAGGAAGGAATAGACAGCAGACAGGATCGCTTGCGATGGGGCAGCAGGAAACCATCAGCCCGTCCG 
TATCCAGAACTGCTGCTTGCAGCGTTAGGGGT<3ACTCCCTCATGCCCTAG 

Protein sequence: (SEQ ID NO: 223) 

VSSTKSKPIARGRGGPFGEVMKRCGLVPVRGRNRQQTGSLAMGQQETISPSVSRTAACSVRGDSLMP 
RSIO 

DNA sequence: (SEQ ID NO: 103) 

ATGGAACGCTTGCTCGAGAGCATTTACATCAATGCCCGGCCGGCGATGGAGTTGAGGCTTAGCCTCACCAGCTCCGG 
CCGCAAGAGAATGGTAAAGATTGTGGATGGGGAGGAGGTCGAGGTTCTGCCAGGTGAAGTGCAGGGCATCCTGGAGG 
CCCAAAAGAGGGATGTTGG7VATCCTCGCCGACTTCTTAGCCAAGAGTCTCGTGGCGCGACGCTAG 

Protein sequence: (SEQ ID NO: 224) 

MERLLESIYINARPAMELRLSLTSSGRKRMVKIVDGEEVEVLPGEVQGILEAQKRDVGILADFLAKSLVARR 
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RSOll 

DNA sequence: (SEQ ID NO: 104) 

ATGGAATGCCACGTTCGTCCCGCCACGAGCAGAGATGCAGCAGCGATAAGCTGCGTAGTTATAGCCGCCCTGCGTGA 
GTCAAATTCACAGGACTATCCGCCTGATGTGATCGCTCAGGTTGAGCAGAGCTTTTCTCCTGAAGCCATCACCACAC 

AGCTTACGAAGCGTAGGGTCTTCGTAGCCTTATTGGGCGAAAACATTATTGGCACTGCCGGTCTCGACGGTGACGTC 
GTCAGAAGTGTTTTCGTTGACCCAGCTCACCAGAAAGGCGGTATCGGGCGGCATTTGATGGATGTCATTCATACAAC 
TGCTGCCAGCGCGGGAGTTGGAGCTGTACGTGTGCCATCGTCGATTACAGCTGAAAGGTTTTATACCGCATTGGGTT 
ATCAGAAAATCCGCGACGAGTTTCATGGGGCGGAGCGCACCATCGTTATGGAGAAGCGGCTGTAG 

Protein sequence: (SEQ ID NO: 225) 

MECHVRPATSRDAAAISCWIAALRESNSQDYPPDVIAQVEQSFSPEAITTQLTKRRVFVALLGENIIGTAGLDGDV 
VRSVFVDPAHQKGGIGRHLMDVIHTTAASAGVGAVRVPSSITAERFYTALGYQKIRDEFHGAERTIVMEKRL 

RS12 

DNA sequence: (SEQ ID NO: 105) 

TTGTGGTTGACCTGCACGCCACAGCAGGATGTGCAGGCGGCGTTAGCTACAGCGTCGATACTCCTGGGCCAGTTCC7V 
CCAGTTGGGCGTGCAGCTCGGTCGGTACACTAGCCTCGACCCGCTTGAGGAAGTCGAGAAGAACGCTTCTGCACTGC 
CGTCTCCTGCTTGGAAAACGGATTCTACTAAGTTCAGCGTGGTACTGAAATCGGGGGGCAGGTCAATCGACAAA&GT 
ATCCCGACGGCAGGTTTGTTGGCCCACGTGATGGTGGCCAAGTTTGCCGATCACTTGCCGCTGTACCGGCAGGAGAA 
AATCTTTGGCCGCGCCGGGCTGGCAATTGCTCGCTCGACCCTGGCGCAGTGGGTCGGACAAACCGGCGTGCGGCTTC 
AGCCACTGGTCGATGCACTGCGTGAAGCCGTGCTGAACCAGGGCGTGATCCACGCTGATGAAACACCGGTGCAAATG 
CTTGCGCCAGGCGAGAAGAAAACCCACCGGGCCTATGTCTGGGCGTACAGCACGACGCCGTTTTCAGGGCTCAAAGC 
GGTGGTTTACGACTTCAGCCCAAGCCGTGCTGGCGAACATGCGCGCAACTTCCTGGGTGACTGGAACGGCAAGCTGG 
TCTGCGACGACTTCGCTGGCTACAAAGCCGGTTTCGAACAAGGCATCACTGAAATCGGCTGCATGGCCCACGCCCGG 
CGCAAGTTCTTTGATTTGCACGTGGCGAACAAAAGTCAGCTGGCTGAACAGGCCCTGCACTCGATCAGCGGCTTGTA 
CGAGGTCGAACGTCAGGCGCGGGACATGAGTGATGAAGAGCGCTGGCGAATACGACAAGAATTGGCGGTGCCGATCC 
TCAAAAAACTGCATGACTGGATGTTGGCTCAGCGAGACCTGGTGCCCAATGGATCAGCCACGGCCAAAGCCCTCGAT 
TACAGCCTGAAACGCTGGGTAGCGCTGACGCGCTACCTGGACGATGGGGCTGTGCCCATCGATAACAATCAGGTCGA 
GAACCAAATACGGCCATGGGCGCTCGGGCGTTCGAACTGGCTGTTTGCCGGGTCGCTGCGCAGTGGTAAACGGGCGG 
CTGCAATCATGAGCCTGATCTAG 

Protein sequence: (SEQ ID NO: 226) 

LWLTCT PQQDVQAALATAS I LLGQFHQLGVQLGRYTSLD PLEEVEKNASALPS PAWKTDS TKFS WLKSGGRS I DKG 
I PTAGLIiAHVMVAKFADHLPLYRQEKI FGRAGLAI ARSTLAQWVGQTGVRLQPLVDALREAVLNQGVI H7U)ETPVQM 
LAPGEKKTHRAYWAYSTTPFSGLKAVVYDFSPSRAGEHARNFLGDWNGKLVCDDFAGYKAGFEQGITEIGCMAHAR 
RKFFDLHVANKSQLAEQALHSISGLYEVERQARDMSDEERWRIRQELAVPILKKLHDWMLAQRDLVPNGSATAKALD 
YSLKRWVALTRYLDDGAVPIDNNQVENQIRPWALGRSNWLFAGSLRSGKRAAAIMSLI 

RS13 

DNA sequence: (SEQ ID NO: 106) 

ATGGTGAGGCGGCGGAGGGTCGCGGTGGCGCGCGAATGCCTGAGCCTGTCGAGCGCACCGAACCAGGTCTTGTCGAT 
GGATTTCGTCTTCGACGCGCTCAGCACTGGGCGACGGATCAAATGCCTGACGGTGGTCGATGACTTCACCAAGGTGT 
CGGTCGACATCTTGGTGGAGTACGGTATCAGCGGTTTTCGTGTCACGCGGGCGCTGGACGAGATGGCGCGGTTTCGT 
GGCTACCCGCAGGCGATCCGCACCGACCAGGGCCCCGAGTTCACCGGCAAGGCGCTTGATCAGTGGGCCTGTCAGCG 
TGACATCAAGTTGAAGCTGATTCAGCCTGGCCAGCCCACGCAGAGCGCCTTCATCGAGTCATTCAACGGCAAGTTCC 
GGGGCGAATGCCTCAATGAGCACTGCTCGCTGGTCGAAGCCAGAATCCGTATCGCGGCTTGGCGGGATTACAACGAG 
CACCGACCACACAGCGCCATTGGCAATCTCTCCCCGGCAGAGCTTGCTGCGAAGTGGCGAACCAACCAGCAGCAGCT 
GAAGCGGGAAAAGTTGATATCAACCCCATAG 

Protein sequence: (SEQ ID NO: 227) 

MVRRRRVAVARECLSLSSAPNQVIiSMDFVFDALSTGRRIKCLTVVDDFTKVSVDILVEYGISGFRVTRALDEMARFR 
GYPQAIRTDQGPEFTGKALDQWACQRDIKLKLIQPGQPTQSAFIESFNGKFRGECLNEHCSLVEARIRIAAWRDYNE 
HRPHSAIGNLSPAELAAKWRTNQQQLKREKLISTP 
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RS14 

DNA sequence: (SEQ ID NO: 107) 

ATGCATATCCAATCGTTGGGGGCTACTGCCTCCTCGCTGAATCAGGAGCCTGTCGAAACCCCGTCGCAGGCAGCGCATAA 
GTCCGCCAGCTTGCGTCAGGAACCTTCAGGGCAAGGTCTCGGGGTTGCCCTAAAGAGCACGCCGGGAATACTTTCCGGGA 
AGTTGCCGGAAAGCGTTAGCGACGTGCGTTTCAGCAGTCCCCAAGGGCAAGGGGAGTCCCGTACTCTGACTGACTCGGCA 
GGGCCGCGGCAGATCACTCTGCGCCAGTTTGAGAACGGAGTCACCGAGCTACAGCTCAGTCGGCCACCATTGACCAGTCT 
GGTCCTAAGCGGCGGTGGTGCCAAAGGTGCGGCATACCCGGGAGCAATGCTGGCGCTAGAAGAGATU^GGCATGCTCGATG 
GCATCCGCAGCATGTCCGGTTCGTCCGCTGGCGGCATCACCGCCGCCCTTTTGGCCTCAGGTATGAGCCCGGCGGCGTTC 
AAGACCCTTTCCGACAAGATGGATCTTATTTCGCTGCTCGACAGCTCGAACAAGAAGCTGAAGCTGTTCCAACACATTAG 
CAGCGAGATCGGCGCATCGCTGAAAAAGGGCTTGGGCAACAAGATCGGCGGCTTCTCTGAGTTGCTGCTCAATGTACTCC 
CACGCATAGATTCGCGGGCTGAGCCCCTAGAACGCCTATTGCGCGACGAGACACGCAAGGCCGTGCTCGGACAGATCGCT 
ACGCATCCAGAGGTTGCACGCCAGCCGACCGTTGCCGCCATCGCCAGCAGATTGCAGTCCGGCTCCGGAGTCACCTTTGG 
CGATCTAGATCGGTTGAGTGCTTACATTCCCCAGATTAAGACGCTGAACATCACAGGTACGGCCATGTTCGAGGGGCGTC 
CGCAATTAGTGGTGTTCAATGCCAGCCACACACCGGATCTGGAGGTCGCCCAGGCGGCACATATCTCCGGTTCCTTCCCA 
GGAGTGTTCCAGAAGGTCAGCTTGAGTGATCAGCCGTACCAGGCCGGCGTAGAGTGGACAGAATTCCAGGATGGCGGGGT 
GATGATTAACGTGCCGGTCCCTGAGATGATCGACAAGAATTTTGACAGCGGGCCACTGCGGCGCAACGACAACCTGATCC 
TTGAGTTCGAGGGCGAAGCTGGGGAGGTAGCGCCCGACCGAGGTACTAGGGGCGGCGCGCTCAAGGGCTGGGTCGTCGGG 
GTGCCTGCCCTGCAGGCGCGCGAAATGCTGCAGCTCGAGGGCCTGGAGGAATTGCGCGAGCAAACCGTTGTGGTGCCGTT 
GAAGAGCGAGCGCGGTGATTTCAGTGGCATGCTCGGTGGCACCTTGAACTTCACCATGCCGGACGAGATCAAGGCGCATC 
TTCAGGAGCGCCTCCAGGAGCGAGTCGGTGAACATCTGGAGAAACGTCTTCAGGCTTCAGAGCGTCATACCTTCGCTTCT 
CTCGACGAGGCGCTGCTGGCACTTGATGACAGTATGCTCACCAGTGTTGCTCAACAGAACCCGGAGATCACAGACGGGGC 
GGTGGCTTTTCGCCAGAAGGCGCGGGATGCGTTCACCGAGCTGACTGTCGCTATCGTTAGCGCCAATGGCTTGGCGGGTA 
GGCTCAAGTTGGACGAGGCTATGCGCTCCGCTCTTCAGCGACTCGATGCGCTGGCAGATACTCCGGAACGCCTAGCATGG 
TTGGCAGCTGAGTTGAACCATGCTGAT7UVCGTTGATCATCAGCAGTTACTCGATGCCATGCGCGGGCAGACGGTGCAGTC 
GCCGGTGCTCGCCGCTGCGTTAGCAGAGGCGCAGCGCCGCAAAGTGGCGGTTATTGCCGAGAACATTCGTAAGGAAGTTA 
TCTTCCCCTCTCTGTATCGCCCTGGCCAGCCGGATTCCAACGTAGCTCTGTTACGTCGGGCGGAGGAGCAGCTACGGCAT 
GCCACCAGTCCGGCGGAAATCAATCAAGCGCTGAACGATATCGTCGACAACTACTCGGCACGAGGCTTCCTGCGTTTCGG 
CAAACCCTTGAGTTCGACTACCGTTGAGATGGCTAAGGCTTGGCGGAATAAGGAGTTCACATGATT 

Protein sequence (SEQ ID NO: 22 8) 

MHIQSLGATASSLNQEPVETPSQAAHKSASLRQEPSGQGLGVALKSTPGILSGKLPESVSDVRFSSPQGQGESRTIiTDSA 
GPRQITLRQFENGVTELQLSRPPLTSLVLSGGGAKGAAYPGAMLALEEKGMLDGIRSMSGSSAGGITAALLASGMSPAAF 
KTLSDKMDLISLLDSSNKKLKLFQHISSEIGASLKKGLGNKIGGFSELLLNVLPRIDSRAEPLERLLRDETRKAVLGQIA 
THPEVARQPTVAAI ASRLQSGSGVTFGDLDRLSAYI PQ I KTLNI TGTAMFEGRPQLWFNASHTPDLEVAQAAHI SGS FP 

GVFQKVSLSDQPYQAGVEWTEFQDGGVMINVPVPEMIDKNFDSGPLRRNDNLILEFEGEAGEVAPDRGTRGGALKGWVVG 
VPALQAREMLQLEGLEELREQTVWPLKSERGDFSGMLGGTLNFTMPDEIKAHLQERLQERVGEHLEKRLQASERHTFAS 
LDEALLALDDSMLTSVAQQNPEITDGAVAFRQKARDAFTELTVAIVSANGLAGRLKLDEAMRSALQRLDALADTPERLAW 
LAAELNHADNVDHQQLLDAMRGQTVQS PVLAAALAEAQRRKVAVI AEN I RKE VI F PS LYRPGQPDSNVALLRRAEEQLRH 
ATSPAEINQALNDIVDNYSARGFLRFGKPLSSTTVEMAKAWRNKEFT 

RS15 

DNA sequence: (SEQ ID NO: 108) 

ATGATTGATACATGGCTGGCACAGTGGGGCTTGAGACTTCCCTCGAGCAACGATGCCACGTTGCGGCTGCAACCGGCAGA 
GGGACCGGAACTGGTTATGGAGCGCCTCGAGGGCGGTTGGCTTTTCGTCGTCGAGTTGGGACTTGTGCCTTCAGGGTTAC 
CGCTGGGTGTGATCTTGCAATTGTTACAAGTGAACTCTCCATTCTCATCCTTGGCACCGGTGAAACTTGCGGCGGACGAT 
GCCGGTAGACTTGTGCTCTGGGCTGAGGCACGTGATGGCGTTGACGATGTGGATGCACTGAACCGCTTGCACGATAGGCT 
GCGGGAAGGACATTCACGATTAGTGCCATTGCTAGAGCCCACGGGTGAGTTGGTTCCAGCTCAGATACAAACCAGCGCGT 
TAGTGTTCGTTTGA 

Protein sequence: (SEQ ID NO: 229) 

MIDTWLAQWGLRLPSSNDATLRLQPAEGPELVMERLEGGWLFWELGLVPSGLPLGVILQLLQWSPFSSIAPVKLAADD 
AGRLVLWAEARDGVDDVDALNRLHDRLREGHSRLVPLLEPTGELVPAQIQTSALVFV 
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FIG. 32 



Big Island: Overall Nucleotide Homology 
Total 84830bp 

Bp# Species, strain, gene name 



Accession # 



Evalue/ %identity 



23559-25465: X. axonopodis pv. Citri strain 306 AE01 1 864 83% 

Only stretches within the 20437-25465bp are homologous to X. axonopodis pv. Citri strain 306. 
A total of 1060bp, not contiguous, from this region are homologous to X. axonopodis. 



33872-38412: 



P. aeruginosa, PA14, pvrR 



AF482691 



0.0; 99% 



40989-46535: P. aeruginosa, PA01 , AE004640 80% 

PA21 28-21 32 

Only stretches within the 40989-46535bp region are homologous to PA01 . A total of 2406bp, not 
contiguous, from this region are homologous to PA01. 

48266-49533: P. putida, plasmid pWWO AJ344068 96% 

Only stretches within the 48266-49533bp are homologous to P. putida, plasmid pWWO. A total of 
780bp, not contiguous, from this region are homologous to P. putida. 

56824-58706: P. syringae pv. maculicola, AF359557 83% 

plasmid pFKN 

Only stretches within the 56824-58706bp are homologous to P. syringae pv. maculicola, plasmid 
pFKN. A total of 1882bp, not contiguous, from this region are homologous to P. syringae. 



64748-64942: 

82447-85179 

85334-855542: 

93200-93317: 

108075-108610: 

100119-101054: 



P. aeruginosa, PA103, exoU, U97065 

P. aeruginosa, PA01 , PA0984-0985 AE004531 

P. aeruginosa, PA1 58 X73064 

P. aeruginosa, SGI 7M, AF28541 6 
plasmid pKLC102 

P. aeruginosa, PA01 , PA3849 AE4802 



1E-85/96% 
0.0; 97% 
3E.80/94% 
7E-50/98% 
0.0/91% 

0.0/98% 
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FIG. 33 



Small Island: Overall Nucleotide Homology 
Total 10848bp 



Bp# 



(Species, strain, gene name 



Accession #) 



Evalue/ %identity 



877-1632: P. aeruginosa. PA0^.PA0977 AE004531 0.0/94% 

877-2603: P. aeruginosa, AF285416 0.0/92% 

SG17M plasmid pKLC102 
2978-6471: P. aerug/nosa, PA01, AE004531 0.0/99% 

PA0978-81 

7035-7999: P. syringae pv. maculicola, AF359557 83% 

plasmid pFKN 

Only stretches within the 7035-7999bp are homologous to P. syringae pv. maculicola, plasmid 
pFKN. A total of 534bp, not contiguous, from this region are homologous to P. syringae. 



7999- 8284: 

8000- 8080: 

8120-8259: 
8272-8860: 
8470-11724: 



P. aeniglnosa, PA103, exoU 

P. aeruginosa, 

PA01 intragenic region 

P. aeruginosa, 

PA01 intragenic region 

P. aeruginosa, 

PA01 intragenic region 

P. aeruginosa, PA103, exoU, 



AF27291 
AE004531 

AE004531 

AE004531 

AF27291 



E-1 36/96% 
E-18/91% 

E-24/85% 

E-176/88% 

0.0/99% 
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FIG. 35 

Alignment: No_2 - embi[AL039136]HSM003612 



Homo sapiens mRNA; EST DKFZp566K094_r1(from clone DKFZp566) 

Q:18 DQTCDNLSQNPPHHLLLRLLDHWGDPAGCWSLGQTYSGHLYLPYCRELHKCSLCAHRNWH SEQ ID NO: 23 0 
DQTCDNLSQNPPHHLLLRLLDHWGDPAGCWSLGQTYSGHLYLPYCRELHKCSLCAHRNWH SEQ ID NO: 231 
H:29 DQTCDNLSQNPPHHLLLRLLDHWGDPAGCWSLGQTYSGHLYLPYCRELHKCSLCAHRNWH SEQ ID NO: 232 

HYCCLWPVWMLCYMSW 93 
HYCCLWPVWMLCYMSW 
HYCCLWPVWMLCYMSW 2 56 
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FIG. 36 

Alignment: No_8 - embl[M79137]HSXT01285 



EST01285 Subtracted Hippocampus, Stratagene (cat.#936205)H 

Q:18 QVQHPPLCLLDQHQQECIPPCLPPDHLQDPQHPFLLPDHHVPHLWLIQPQLCRALAP SEQ ID NO: 233 
75 

QVQHP .CLLDQHQQECIPPCLPPDHLQDPQHPFLLPDHHVPHLWLIQPQLCRALAP SEQ ID NO: 234 
H:43 QVQHPXXCLLDQHQQECIPPCIiPPDHLQDPQHPFLLPDHHVPHLWLIQPQLCRALAP SEQ ID NO: 235 
216 
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FIG. 37 

Alignment: No47-swissnew[p35555]FBN1_HUMAN 



FIBRILLIN 1 PRECURSOR.//:swlss[P35555]FBN1_HUMAN FIBRILLIN1 

PRECURSOR.//:trembl tL13923]HSFIBRLLN_1 product: "fibrillin"; Homo sapiens fibrillin 

mRNA, complete cds.//:gp[L13923]306746 product: "fibrillin"; Homo sapiens fibrillin mRNA.comptete cds. 

Q: 18 CGGASCHNTLGSYKCMCPAGFQYEQFSGGCQDINECGSAQAPCSYGCSNTEGGYLCGCPP SEQ ID NO: 236 

CGGASCHNTLGSYKCMCPAGFQYEQFSGGCQDINECGSAQAPCSYGCSNTEGGYLCGCPP SEQ ID NO: 237 

H:2617 CGGASCHNTLGSYKCMCPAGFQYEQFSGGCQDINECGSAQAPCSYGCSNTEGGYLCGCPP SEQ ID NO: 238 

GYFRIGQGHCVSGMGMGRGNPEPPVSGEMDDNSLSPEACYECKINGYPKRGRKRRSTNET SEQ ID NO: 236 

GYFRIGQGHCVSGMGMGRGNPEPPVSGEMDDNSLSPEACYECKINGYPKRGRKRRSTNET SEQ ID NO: 237 

GYFRIGQGHCVSGMGMGRGNPEPPVSGEMDDNSLSPEACYECKINGYPKRGRKRRSTNET SEQ ID NO: 238 



DASNIEDQSETEANVSLASWDVEKTAIFAFNISHV-NKVRIL 178 SEQ ID NO: 236 
DASNIEDQSETEANVSLASWDVEKTAIFAFNISHV NKVRIL SEQ ID NO: 237 

DASNIEDQSETEANVSLASWDVEKTAIFAFNISHVSNKVRIL 2778 SEQ ID NO:238 
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FIG. 38 



Alignment: No56-trembl[AF088916]AF088916.1 



gene:"EMr;product:"elastin microfibril interfase located protein"; Homo sapiens elastin microfibril 
interfase located protein (EMI) gene, complete cds, 

//itrembi [AF088916]AF088916_1 product: "emilin precursor"; Homo sapiens emilin precursor, 
mRNA, complete cds and 3* UTR.//:gp[AF088916]5353510 product:"emilin precursor"; 
Homo sapiens emilin precursor, mRNA, complete cds and 3'UTR. 

//:gpnew[AF162780]6693840 gene:"EMr; product: "elastin microfibril interfase located protein; Homo 
sapiens elastin microfibril interfase located protein (EMI) gene, complete cds. 



Q: 7 DGDVYNPSTGVFTAPYDGRYLITATLTPERDAYVEAVLSVSNASVAQLHTAGYRREFLEY SEQ ID NO: 23 9 
DG. , YrP.TGVFTAP. GRYL::A.LT .R.. VEAVLS SN . . VA: : . : . GY . E LE SEQ ID NO:240 
H:896 DGGYYDPETGVFTAPLAGRYLLSAVLTGHRHEKVEAVLSRSNQGVARVDSGGYEPEGLE- SEQ ID NO: 241 

HRPPGALHTCGGP-GAFHLIVHLKAGDAV 94 SEQ ID NO: 239 
::P : . . G. G.F.LI: L:AGD.V SEQ ID NO:240 

NKPVAESQPSPGTIiGVFSLILPLQAGDTV 983 SEQ ID NO: 241 

gene:"EMr;product:''elastin microfibril interfase located protein"; Homo Sapiens elastin microfibril 
interfase located protein (EMI) gene, complete cds. 

//:trembl [AF088916]AF088916_1 product: "emilin precursor"; Homo sapiens emilin precursor, 
mRNA, complete cds and 3' UTR.//:gp[AF088916]5353510 product:"emilin precursor" 
Homo sapiens emilin precursor, mRNA, complete cds and 3'UTR. 

//:gpnew[AF162780]6693840 gene:"EMr; product: "elastin microfibril interfase located protein". Homo 
sapiens elastin microfibril interfase located protein (EMI) gene, complete cds 

Q: 7 DGDVYNPSTGVFTAPYDGRYLITATLTPERDAYVEAVLSVSNASVAQLHTAGYRREFIjEY SEQ ID NO: 242 
DG. .Y:P.TGVFTAP. GRYL::A.LT .R.. VEAVLS SN. .VA: : . : .GY. E LE SEQ ID N0:243 
H:896 DGGYYDPETGVFTAPLAGRYLLSAVLTGHRHEKVEAVLSRSNQGVARVDSGGYEPEGLEN SEQ ID NO: 244 

HRPPGALHTCGGPGAFHLIVHLKAGDAV 94 SEQ ID NO: 242 

. : : G. G.F.LI: L:AGD.V SEQ ID NO:243 

KPVAESQPSPGTLGVFSLILPLQAGDTV 983 SEQ ID NO: 244 
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FIG. 39 



Alignment: No59-pIronly[A35763]A35763 



unnamed ORF;P.lividus 2-alpha collagen(COLL2-alpha) mRNA, complete cds. 
//:pironly[A35763]A35763 collagen alpha 2 chain-sea urchin(Paracentrotus lividus) 
{fragment)//:gp[J05422]1 59962 unnamed ORF;P.lividus 2-aIpha collagen(COLL2-alpha) 
mRNA, complete cds. 

Q: 92 GENGSSGSQAPLQGLRGIFGLWGRRSRARFCGPR-PVARLGGGTSAGRELGL 142 SEQ ID NO: 245 
GE G.SG. . .P QG:RGI G: G. . . : . GPR P . GGG S G. . GL SEQ ID NO:246 

H:718 GEPGPSGENGP-QGVRGIPGWGENGKTGRGGPRGPPGLRGGGGSRGERGGL 768 SEQ ID NO: 24 7 



unnamed ORF;P.lividus 2-alpha collagen{COLL2-alpha) mRNA, complete cds. //:pironly[A35763]A35763 
collagen alpha 2 chain-sea urchin(Paracentrotus lividus) (fragment)//:gp[J05422] 159962 unnamed 
ORF;P.lividus 2-alpha collagen(COLL2-alpha) mRNA, complete cds. 

Q: 92 GENGSSGSQAPLQGLRGIFGLWGRRSRARFCGPR-PVARLGGGTSAGRELGL 142 SEQ ID NO: 248 
GE G,SG...P QG:RGI G: G. . . : . GPR P . GGG S.G.. GL SEQ ID NO:249 

H:718 GEPGPSGENGP-QGVRGIPGWGENGKTGRGGPRGPPGLRGGGGSRGERGGL 768 SEQ ID NO: 250 
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FIG. 40 



Alignment: No60/63-swiss[P20062]TCO2_HUMAN 



TRANSCOBALAMIN II PRECURSOR.//:trembl[M60396]HSTCIL1gene:"TCN2-; product: 
"transcobalamin IT; Human transcobalamin II (TCII) mRNA, complete cds. 

//:gp[M60396]339196 gene: "TCN2"; product: "transcobalamin 11"; Human transcobalamin 11 (TCII) 
mRNA, complete cds. 

Q: 8 VEPFHQGHHSVDTAAl^GLAFTCLKRSNFNPGRRQRITMAIRTVREEILKAQTPEGHFGN SEQ ID NO: 251 
VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMAIRTVREEILKAQTPEGHFGN SEQ ID NO: 252 
H:183 VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMAIRTVREEILKAQTPEGHFGN SEQ ID NO: 253 



VYSTPLALQFLMTSPMPGAELGTACLKARVALLASLQDGAFQNALMISQLLPVLNHKTYI SEQ ID NO: 251 
VYSTPLALQFLMTSPMPGAELGTACLKARVALLASLQDGAFQNALMISQLLPVLNHKTYI SEQ ID NO: 252 
VYSTPLALQFLMTSPMPGAELGTACLKARVALLASLQDGAFQNALMISQLLPVLNHKTYI SEQ ID NO: 253 



DL I FPDCLAPRVMLE PAA 
DLIFPDCLAPRVMLEPAA 
DL I FPDCLAPRVMLE PAA 



145 SEQ ID NO: 251 
SEQ ID NO: 2 52 
320 SEQ ID NO: 253 



TRANSCOBALAMIN II PRECURSOR.//:trembl[[\/l60396]HSTCIMgene:"TCN2''; product: 
"transcobalamin H"; Human transcobalamin II (TCII) mRNA, complete cds. 
//:gp[M60396]339196 gene: "TCN2"; product: "transcobalamin 11"; Human transcobalamin II 
(TCII) mRNA, complete cds. 

Q: 8 VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMA 47 SEQ ID NO: 254 
VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMA SEQ ID NO: 255 

H:183 VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMA 222 SEQ ID NO: 256 



TRANSCOBALAMIN II PRECURSOR.//:trembl[M60396]HSTCIMgene:"TCN2";product: 
"transcobalamin IT; Human transcobalamin II (TCII) mRNA, complete cds. 
//:gp[M60396]339196 gene: "TCN2"; product: "transcobalamin 11"; Human transcobalamin II 
(TCII) mRNA, complete cds. 

Q: 8 VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMA 47 SEQ ID NO: 257 

VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMA SEQ ID NO: 258 

H:183 VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMA 222 SEQ ID NO: 259 
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FIG. 41 



Alignment: No65-swissnew[P23142]FBL1_HUMAN 



FlBULIN-1 PRECURSOR.//:swiss[P37888]FBLD_HUMAN FIBULIN-1, ISOFORM D 
PRECURSOR.//:trembl[U01244]HS2444_1 product: "fibulin-1D": Homo sapiens fibulin-1D 
mRNA, complete cds.//:gp[U01244]1621019 product: Tibulin-1D"; Homo sapiens fibulin-1D 
mRNA, complete cds. 

Q: 18 RNCQDIDECVTGIHNCSINETCFNIQGGFRCLAFECPENYRRSAATLQQEKTDTVRCIKS SEQ ID NO: 260 
RNCQDIDECVTGIHNCSINETCFNIQG.FRCLAFECPENYRRSAATLQQEKTDTVRCIKS SEQ ID NO: 261 
H:521 RNCQDIDECVTGIHNCSINETCFNIQGAFRCLAFECPENYRRSAATLQQEKTDTVRCIKS SEQ ID NO: 262 

CRPNDVTCVFDPVHTISHTVISLPTFREFTRPEEIIFLRAITPPHPASQANIIFDITEGN SEQ ID NO: 260 
CRPNDVTCVFDPVHTISHTVISIiPTFREFTRPEEIIFLRAITPPHPASQANIIFDITEGN SEQ ID NO: 261 
CRPNDVTCVFDPVHTISHTVISLPTFREFTRPEEIIFLRAITPPHPASQANIIFDITEGN SEQ ID NO: 262 



LRDSFDIIKRYMDGMTVGIRR 158 SEQ ID NO: 260 
LRDSFDIIKRYMDGMTVG: R SEQ ID NO: 261 

LRDSFDIIKRYMDGMTVGWR 661 SEQ ID NO: 262 
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FIG. 42 

Alignment: 80 - trembl[AF045447]AF045447_1 



gene: "DPC4"; product:"deleted in pancreatic carcinoma"; Homo sapiens deleted in 
pancreatic carcinoma (DPC4) gene, exon 1 1 partial sequence and complete cds. 
//:trebl[U44378]HS443781_1 gene: "DPC4"; product: "Dpc4"; Human homozygous deletion 
target in pancreatic carcinoma (DPC4) mRNA, complete cds. //:pironly[S71811]S71811 
probable transcription regulator MAD-4-human//:gp[AF045447]2865657 gene:"DPC4"; 
product: "deleted in pancreatic carcinoma"; Homo sapiens deleted in pancreatic carcinoma 
(DPC4) gene, exon 1 1 partial sequence and complete cds. //:gp[U44378]1 163234 gene: 
"DPC4"; product: "Dpc4"; Human homozygous deletion target in pancreatic carcinoma 
(DPC4) mRNA, complete cds. 

Q: 6 PGSRIRGRVDTLQXNAPXXMMVKDEYVHDFEGQPXLXTEGHXIQTIQHPPXNRAXTETYX SEQ ID NO: 2 63 

PG : . G TLQ . NAP . . MMVKDEYVHDFEGQP . L . TEGH . IQTIQHPP . NRA . TETY . SEQ ID NO : 2 64 

H:139 PGIDLSGL--TLQSNAPSSMMVKDEYVHDFEGQPSLSTEGHSIQTIQHPPSNRASTETYS SEQ ID NO: 265 

TPALLAPXEXNATXTANFPNIPVAXTXQPTOCILGGXHXEGLLQIAXGPQPGQQQNGFTGQ SEQ ID NO: 2 63 

TPALLAP.E.NAT.TANFPNIPVA.T.QPA.ILGG.H.EGLLQIA.GPQPGQQQNGFTGQ SEQ ID NO: 2 64 

TPALLAPSESNATSTANFPNIPVASTSQPASILGGSHSEGLLQIASGPQPGQQQNGFTGQ SEQ ID NO: 265 



PATYHHNXTTTWTGXRTAPYTPNIiPHHQKG 155 SEQ ID NO: 2 63 
PATYHHN . TTTWTG . RTAPYTPNI.PHHQ . G SEQ ID NO: 2 64 

PATYHHNSTTTWTGSRTAPYTPNLPHHQNG 286 SEQ ID NO: 265 
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FIG. 43 



Alignment: N086 - trembl[D32210]D32210 .1 



gene: "Notch2"; product:"cell surface protein"; Mus musculus (Notch2) nnRNA, complete cds. 
//:gp[D32210]2373395 gene: "Notch2"; product:"cell surface protein"; Mus musculus 
(Notch2) mRNA, complete cds. 

Q:81 MPAIiRPALLWALLALWLCCATPAHALQCRDGYEPCVNEGMCVTYHNGTGYCKCP-GFLGE SEQ ID NO: 2 66 
MP LRPA.L ALL LWLC A PAHALQCR. G. EPCVNEG. CVTYHNGTG : C : CP GFLGE SEQ ID NO:267 
H: 1 MPDLRPAALRALLWLWLCGAGPAHALQCRGGQEPCVNEGTCVTYHNGTGFCRCPEGFLGE SEQ ID NO: 2 68 

YCQHR-PCEKNRCGDPSTC 157 SEQ ID NO: 266 
YCQHR PCEKNRC : .TC SEQ ID NO: 2 67 

YCQHRDPCEKNRCQNGGTC 79 SEQ ID NO: 268 
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FIG. 44 



AATTCGCGGCCGCGTCGACGACCAAACCTGTGATAACCTGTCTCAAAACCCTCCTCATCATCTACTCCTTCGTCT SEQ ID NO: 109 
I 1 1 1 1 I I I -J I I I [ I I L 75 



adaptor | 



,1 RG. RVDD QTC DN L, S QN.P P HH LL L. R L SEQ ID NO: 269 
1 I I 1 i L I t I I I ' I I 

TCTGGATCACTGGGGTGATCCTGCTGGCTGTTGGAGTCTGGGGCAAACTTACTCTGGGCACCTATATCTCCCTTA 
I I 1 I I I I I I I ■ I ■ I I 150 



LDHWGDPAGCWS LGQTYSGHLYLP Y 

I I I I I I I I I I I I I I L 

TTGCaSAGAACTCCACAAATGCTCCCTATGTGCTCATCGGAACTGGCACCACTATTGTTGTCTTTGGCCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 225 



CRELHKCSLCAHRNWHHYCCLWPVW 

1 I I I I I I I I I I I I I L 

GATGCTTTGCTACATGTCGTGGTAGCCCATGGATGCTGAAACTGTATGCCATGTTTCTGTCCCTGGTGTTCCTGG 
I I I I I I I I I I ■ I I I I 300 



MLCYMSW PMDAETVCHVSVPGVPG 

■ I ■ ^ ■ I ■ ^ » I ■ * f I F 

CTGAGCTCGTAGCTGGCATTTCAGGGTTTGTGTTTCGTCATGAGATCAAGGACACCTTCCTGAGGACTTACACGG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 L 375 

ARSWHF RVCVSS DQGHLPEDIiHG 
I I I I I I I I , I I I I I I I 



ACGCTATGCAGACTTACAATGGCAATGATGAGAGGAGCCGGGCAGTGGACCATGTGCAGCGCAGCCTGAGCTGCT 
I I i I I I I I I I I I I I L 450 



RYADLQWQ. .EEPGSGPCAAQPELL 
I I I I I I I I I I I ■ i ' 

GTGGTGTGCAGAACTACACCAACTGGAGCACCAGCCCCTACTTCCTGGAGCATGGC 
I I I I 1 I I I I I 506 

WCAELHQLEHQPLIiPGAW 

I I I I I I I I I I 



-N 
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FIG. 45 



AATTCGCGGCCGCGTCGACCAAGTGCAACACCCTCCACTGTGCCTTTTGGACCAGCACCAACAGGAATGTATCCC SEQ ID NO: 110 
I 1 1 1 I I 1 ■ I 1 I I I I I L 75 



adaptor 



IRG. RVDOVQHPPLCLLDQHQQECI P SEQ ID NO: 270 

— I i I I I I I I I I I r I I I. 

TCCGTGCCTCCCACCGGACCACCTCCAGGACCCCCAGCACCCTTTCCTCCITCaSGACa^TCy^TGTCCC 

1 1 1 I 1 1 1 1 I I I I I I L 150 

PCLPPDHLQDPQHPFIiLPDHHVPHIi 

I I I I I I I I I I I I I I L 

GGTGGTCCTTATCOVGCCCCAACTGTGCCGGGCCCTGGCCCCACAGGGCCATATCCTACACCAAATATGCCCTTT 
I I U I I ' I I I I ' ■ ' ■ 225 

VVLIQPQLCRAL.APQGHILHQICPF 
I 1 I I I I I I ■ I ■ I ■ 

CCAGAGCTACCCANACCATATGGTGCACCCACAGATCCAGCTGCAGCTGGTCCTTTAGGTCCATGGGGATCCATG 
1 1 1 1 1 1 1 1 1 1 1 I I I L 300 



QSYP?HMVHPQIQLQLVIi. VHGDPC 

I 1 1 i I I I I I I I I I I L. 

TCTTCTGGACCTTGGGCGCCAGGAATGGGAGGGCAGTATCCTACCCCTAATATGCCATATCCATCTCCAGGCCCA 
1 1 I I I I I I I I I I I I I. 375 

LLDLGRQEWEGSILPIilCHIHLQAH 
I I I I _j 1 I 1 I 1 I I I ' 

TATCCCGNTCCTCCTCCTCCCCy^GCCCCTGGGGCAGCACCACCTGTTCCATGGGGCACCGTTCCACCAGGAGCC 
I I I I I I I I I I I I I I L 450 

IP?LLLPKPLG QHHLFHGAPFHQEP 

I I I \ I I I I I I I I I I L 

TGGGGACCACCAGCACCATATCCTGCCCCTACAGGATCGTATCCCACACCAGGACTCTATCCTACTCCC 

I I I I I I I I ' ■ ' ' » 519 

GDHQHHILPLQDRI PHQDS ILLP 
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FIG. 46 



AATTCGCXSGCCGCGTCGACTGCGGAGGAGCCTCCTGTCACAACyvCCCrGGGGAGCTACyA SEQ ID NO: 111 



J- 



I 



75 



adaptor 



IRGRVDCGGASCHNTLGSYKCMC PA SEQ ID NO: 271 

I I I I I I I I Li I I I I I ■ 

CGGCTTCCAGTATGAACAGTTCAGTGGAGGATGCCAA6ACATCMTGAATGTGGCTCTGCGCAGGCCCCCTGCAG 
1 I I I I I I I I I I I I I L 150 

GFQYEQFSGGCQDINECGSAQAPCS 

CTATGGCTGTTCCAATACCGAGGGCGGTTACCTGTGTGGCTGTCCACCTGGTTACTTCCGCATAGGCCAAGGGCA 
i I I I I I I ' I I I I I L 225 

YGCSNTEGGYIiCGCPPGYFRIGQGH 
1 1 I I I I I I I I I I I i I 

CTGTGTTTCTGGAATGGGCATGGGCCGAGGAAACCCAGAGCCACCTGTCAGTGGTGAAATGGATGAO^ 

I I I I I J I I I I I I I I L 300 

CVSGMGMGRGNPEPPVSGEMDDNSL 

I I I I I I I I I I I I I I L 

CTCCCCAGAGGCTTGTTACGAGTGTAAGATCAATGGCTACCCCAAACGGGGCAGGAAACGGAGAAGCACAAACGA 
1 1 1 1 1 I I I I I I I I I L 375 

SPEACYECKINGYPKRGRKRRSTNE 

. 1 I I 1 I I I I I 1 I I I L 

AACTGATGCCTCCAATATCGAGGATCAGTCTGAGACAGAAGCCAATGTGAGTCTTGCAAGTTGGGATGTTGAGAA 
1 I I I I I 1 I I I I I ■ I ■ 450 

TDASNIEDQSETEANVSLASWDVEK 

1 I I I I I I I I I I I I I L 

GACAGCCATCTTTGCTTTCAATATTTCCCACGTCAGTAACAAGGTTCGAATCCTANAACTCCTT 
I I I I I I ' ■ ' ' ' ^ 514 



TAX FAFNISHVSNKVRIL? LL 
I I I I I I —I I I I I I ► 
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FIG. 47 



AATTCGCGGCCGCGTCGACGGGGATGTTTACAACCCCAGCACCGGGGTCTTCACGGCTCCTTATGATGGGCGCTA SEQ ID NO: 112 
I I I I I I I I I I I I I I I L 75 



adaptor 



IRGRVDGDVYNPSTGVFTAPYDGRY SEQ ID NO: 272 

I I I I I I I I I I [ I I I I L 

CCTGATCACGGCaiCCCTCACCCCCGAGAGAGACGCCrACGTGGAAGCAGTGCTGTCGGTCTCCAACGCCAGCAG 
I I I I I I I I I I I I I I L 150 

lilTATLTPERDAYVEAVLSVSNASS 
I I I I I I I I I I I I I I I 

TGGCCCAGCTGCATACCGCTGGGTACAGGAGAGAGTTCCTGGAATACCACCGCCCTCCAGGAGCTTTGCATACCT 
1 I I I I I I I I I 1 I I I L 225 

GPAAYRWVQERVPGI PPPSRSFAYL 
I I I I I I 1 I I I I I I I I 

GCGGGGGCCCGGGGGCATTCCACCTCATCGTGCACCTGAAGGCGGGAGATGCAGTCCACGTCGTGGTGACTGGGG 
I I I I I I I I I I I I I I L 300 



RGPGGIPPHRAP 
I I I I I I I 



G6RCSQRRGDWG 

I I I I I I L 



GCAAGCTGGCTCACACAGACTTTGATGAAATGTACTCOVCATTTAGTGGGGTTTTCTTATATCCTTTCCTTTCCC 
I I I I I I I I I I I I I I L 375 



Q A G S H R L 
I I I I 



N V L H I 
J I I 



WGFLISFPFP 

I I I I I L 



ACCTCTAAGGTGGCTGGGGAGATGT 
1 1 1 1 ^ 400 

P L R W L G R C 
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FIG. 48 



AATTCGCGGCCGCGTCGACAAAGAAAAAAAGAAAGTTTTCACTCTGGGCTGTGGAACTATTTCAGGACTTCCTGA SEQ ID NO: 113 
I 1 1 1 1 1 t 1 1 1 1 I I I I L 75 



adaptor 



IRGRVDKEKKKVFTLGCGTISGLPE SEQ ID NO: 2 73 
I I I I I I I I -J I I I ' i ■ 

GGGGTTTCCTCTGGAGCTTCCTGAGTTTCCTCCTGGACATTTTGTCTCCAGGTCCCAGCGCCAGGCAGGTC 

1 1 1 1 1 1 1 1 1 1 1 I I I L 150 



G P P L 
I l_ 



LPEFPP6HFVSRSQRQAGYA 
I I I I I I I I I ' I ■ 



TCCCGGAAGGGCTGTGGGTGCCACCCTGGCTGACTGCAGCCCTCTCTTGCACCTCCTCCCGGCCATCCACCCGCA 
1 1 1 1 1 1 1 1 I I I I I I I 225 

PGRAVGATLADCSPLLHLLPAIHPQ 

I I I I I L I I I \ I I I 

GGAGGTCTTCCCCCAGCACTGGCTTGTGAGGAGCTCCCTCTGCCCGGGAGAAAATGGCTCCTCCGGGTCACAGGC 

1 I I I I I I I I I I I I I L 300 

EVFPQHWLVRSSLCPGENGSSGSQA 
I I I I I I I I I I II- 

TCCCCTCCAGGGACTGAGGGGCATTTTTGGATTGTGGGGAAGGCGCTCCAGGGCCCGGTTCTGTGGCCCCAGGCC 
1 1 1 1 1 1 I _J I I I I I I I 375 



PLQGIiRGI 

I I I I L. 



GLWGRRSRARFCGPRP 
_l I I I I I I I I I. 



TGTTGCTCGGCTGGGTGGAGGCACCTCTGCAGGCCGGGAGCTTGGTCTTTGAACACCT 
1 1 , 1 , J , 1 , 1 1^ ^ 



VARLGGGTSAGRELGL TP 
1 1 I 1 I I I I I I * ^ 
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FIG. 49 



AATTOSCGGCCGCGTCGACGGGGATGTTTACAACCCCAGCACaSGGGTCTTCACGGCTCCTTATGATGGGCGCTA SEQ ID NO: 114 
I I I I I I I I _l I I I I I I L 75 



adaptor 



IRGRVDGDVYNPSTGVFTAPYDGRY SEQ ID NO: 274 

I I I I I I I I _J i I I I I I L 

CCTGATCACGGCCACCCTCACCCCCXSAGAGAGACGCCTACGTGGAAGCAGTGCTGTCGGTCTCCAACGCCAGCAG 
I I I \ I I I I I I I I I L 150 

LITATLTPERDAYVEAVLSVSNAS S 
I I I I I I 1 I I I I I I I I. 

TGGCCCAGCTGCATACCGCTGGGTACAGGAGAGAGTTCCTGGAATACCACCGCCCTCCAGGAGCTTTGCATACCT 
I I I I I I I I I I I I 1 1 1. 225 

GPAAYRWVWERVPGI PPPSRSFAYL 
« ^ * I « I » I ' * « * ' ' ' 

GCGGGGGCCCGGGGGCATTCCACCTCATOSTGCACCTGAAGGCGGGAGATGCAGTCAACGTCGTGGTGA 

I I I I I I I I I I I I I I I- 300 

RGPGGI PPHRAPEGGRCSQRRGDWG 
1 . I . I . I . ( ■ I . I « 

GCAAGCTGGCTCACACAGACTTTGATGAAATGTACTCCACATTTAGTGGGGTTTTCTTATATCCTTTCCTTTCCC 

I I I I I i I I I I I I I I L 375 



Q A G S H R 
I I I L 



L N V L H I 



GFLISFPFP 



ACCTCTAAGGTGGCTGGGGAGATGT 
1 1 —I 1 ^ 400 

PLRWLGRC 
I i I I ► 
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FIG. 50 



AATTCGCGGCCGCGTCGACCGCAACTGTCAAGACATTGATGAGTGTGTGACTGGCATCCACAACTGCTCCATCAA SEQ ID NO: 115 
I . I I I • I 1 1 1 1 1 1 1 1 1. 75 



adaptor 



IRGRVDRNCQDIDECVTG IHNCSIN SEQ ID NO: 275 

I « I I I I I . -J I I . 1 1 1 L 



CGAGACCTGCTTCAACATCCAGGGCGGCTTCCGCTGCCTGGCCTTCGAGTGCCCTGAGAACTACCGCCGCTCCGC 



150 



ETCFNIQGGFRCLAFECPENYRRSA 
I I I I I I I I I 1 1 1 1 1 1 

AGCCACGCTCCAGCAGGAGAAGACAGACACGGTCCGCTGCATCAAGTCCTGCCGCCCCAACGATGTCACATGCGT 

i ■ i ■ ' ■ I I I I I I I L 225 

ATLQQEKTDTVRCI KSCRPNDVTCV 

> ■ I ■ I ■ I ■ I 1 I I I L 

GTTCGACCCCGTGCACACCATCTCCCACACCGTCATCTCGCTGCCTACCTTCCGCGAGTTCACCCGCCCTGAAGA 

» ■ i ' I - I I I I I I I I L 300 

FDPVHTISHTVISLPTFREFTRPEE 

I I I I I I I I I I 1 1 1 \ L 

GATCATCTTCCTCCGGGCCATCACGCCACCGCATCCTGCCAGCCAGGCTAACATCATCTTCGACATCACGGAAGG 

i I I I I I I [ I I I I L 375 

II FLRAITPPHPASQANIIFDITEG 

I ■ I ' I I I I I I I I I L 

GAACCTGCGGGACTCTTTTGACATCATCAAGCGTTACATGGACGGCATGACCGTGGGTGTCGTGCGCCAGGTGCG 

I ■ I . I ■ I ■ I « I I I I 450 

N L R, D S F ^D I I ^ ^ ,^ D G M V G Y V ,R ^1 ^ ? 

GCCCATCGTGGGCCCATTTCATGCCGTCCTGAAGCTGGAGATGAACTATGTGGTCG 

I I I I I -J I I I 1 506 



P I V. G P F.HAVL KL EMNY.VV 

ill I ■ I I 



J I L 
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FIG. 51 



AATTCGCGGCCGCGTCGACACACTGCAGAGTAATGCTCCATCAAGTATGATGGTGAAGGATGAATATGTGCATGA SEQ ID NO: 116 



J- 



j_ 



J L- 



JL 



75 



adaptor 



IRGRVDTLQSNAPSSMMVKD 
I I I I I I I I I I I I l_ 



Y V H D SEQ ID NO: 276 
I I L 



CTTTGAGGGACTVGCCATaSTTGTCCACrGAAGGACa^TTCAATTCAAACCATCCAGCATCCACCAAGTAATCGTGC 
I I I I I I I I I I I I 1 I L 150 

FEGQPSLSTEGHSIQTIQHPPSNRA 

I I I \ I \ I \ I I I I I \ L 



ATCGACAGAGACATACAGCACCCCAGCTCTGTTAGCCCCATCTGAGTCTAATGCTACCAGCACTGCCAACTTTCC 



225 



STBTYSTPALLAPSESNAT 
I [ I I I I I I I I I 



T A N F P 
I I L 



CAACT^TTCCTGTGGCTTCCACAAGTCAGCCTGCCAGTATACTGGGGGGCAGCCATAGTGAAGGACTGTTGCAGAT 

■ i I I 1 I I I I I I I I I L 300 



NIPVASTSQPASILGGSH 

I I I I I I I I I I L. 



G L L Q I 



AGCATCAGGGCCTCAGCCAGGACAGCAGCAGAATGGATTTACTGGTCAGCCAGCTACTTACCATCATAACAGCAC 
I I I I I I I I I I I I I I L 375 



ASGPQPGQQ 



GFTGQPATYHHNST 



TACCACCTGGACTGGAAGTAGGACTGCACCATACACACCTAATTTGCCTCACCACCAAAA 
1 1 I 1 1 1 1 I I 1 1 ^ 435 



TTWTGSRTAPYTPNLPHHQK 
I I I I ' ■ ' I I I ► 
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FIG. 52 



AATTCGCGGCCGCGTCGACCGGCGGCCGAGGAGCGGCGGACTCCGGGCGCGGGGAGTCGAGGCATTTGCGCCTGG SEQ ID NO: 117 

1 1 f i f ^ f H ^ f i 1 1 F ^ 75 

TTAAGCGCCGGCGCAGCTGGCCGCCGGCTCCTCGCCGCCTGAGGCCCGCGCCCCTCAGCTCCGTAAACGCGGACC SEQ ID NO: 118 



adaptor 



IRGRVDRRPRSGGIiRARGVEAFAPG SEQ ID NO: 277 

I I I I I I I I _J I I I I I I L 



GCTTCGGAGj:GTAGCGCCAyGGCCTGAGC<j:TTTGjU^GCA|GAGGAGGGG^^ 

CGAAGCCTCGCATCGCGGTCCCGGACTCGGAAACTTCGTCCTCCTCCCCTCCTCTCTCACCCCGAGGAGATAGCC 



LRSVAPGPEPLKQEEGRREWGSSIG 
t * \ * I » I t I I I I I I 



GACCCCCTCCCCATGTGGATCTGCCCAGGCGGOSGCGGCGGAGGAGGCGACCGAGAAGATGCCCGCCCTGCGCCC 
1 f 1 f 1 f 1— — f 1 -1 \ \ 1 f h 225 

CTGGGGGAGGGGTACACCTAGACGGGTCCGCCGCCGCCGCCTCCTCCGCTGGCTCTTCTACGGGCGGGACGCGGG 



TPSPCGSAQAAAAEEATEKMPALRP 

I I I I l__ I I I I I I I I I L 



CGCTCTGCTGTGGGCGCTGCTGGCGCTCTGGCTGTGCTGCGCGACCCCCGCGCATGCATTGCAGTGTCGAGATGG 

1 f 1 f 1 f 1 I 1 f \ 1 1 1 h 300 

GCGAGACGACACCCGCGACGACCGCGAGACCGACACGACGCGCTGGGGGCGCGTACGTAACGTCACAGCTCTACC 



ALLWAIiLALWLCCATPAHAIiQCRDG 



CTATGAACCCTGTGTAAATGAAGGAATGTGTGTTACCTACCACT^TGGCACAGGATACTGCAAATGTCCM 

1 f 1 f 1 + 1— — f 1 1 1 f 1 1 h 

GATACTTGGGACACATTTACTTCCTTACACACAATGGATGGTGTTACCGTGTCCTATGACGTTTACAGGTNTTCC 



375 



YEPCVNEGMCV 
I I I I I I t 



TYHNGTGYC 



■K G P^ ? G 



CTTCTTGGGGGAATATTGTCAACATCGANACCCCTGTGAGAAGAACCGCTGC 

1 1 1 1 \ 1 1 1 1 427 

GAAGAACCCCCrTATAAO^GTTGTAGCTNTGGGGACACTCTTCTTGGCGACG 



FLGEYCQHR? PCEKNRC 
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FIG. 53 



Pathogenicity 
Island probe # 
(bp#) 


P. aeruginosa strains that gave 
Positive signal with the specific 
probe used 


P. aeruginosa strains that gave 
Negative signal with the specific 
probe used 


3 (25562-26456) 


PA14, CF2. CF6, CF26, CF29 


PA01.PAK.CF1.CF3, 
CF4, CF5. CF27, CF28, 
CF30, CF32 


4(61181-63607) 


PA14, CF2, CF6. CF26. CF29 


PA01.PAK, CF1,CF3, 
CF4, CF5, CF27, CF28. 
CF30, CF32 


5(74933-76117) 


PA14, PA037. CF2, CF6. CF26 


PA01.PAK.CF1,CF3. 
CF4. CF5, CF27, CF28. 
CF29, CF30. CF32 


6 (84922-86622) 


PA01. PA14. PA037. CF2, CF6, 
CF26 


PAK, CF1, CF3, CF4. CF5, 
CF27, CF28, CF29. CF30, 
CF32 


7 (103070-104556) 


PA14, PA037. CF2, CF6. CF26 


PA01,PAK,CF1,CF3, 
CF4. CF5. CF27, CF28. 
CF29, CF30, CF32 


8(104799-105545) 


PA14. CF2, CF6, CF26 


PA01.PAK, CFI.CFS, 
CF4, CF5, CF27. CF28, 
CF29, CF30, CF32 



These experiments indicate that at least part of the big island region contained in each probe is 
present in the P. aeruginosa strains that gave positive hybridization signal. 
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FIG. 54 



ORF7 Protein SEQ ID NO: 278 

MINSHLLYRLSYRGTSFFQPWTLPVLLDSRLRGAPFYGCARACQPSDPKSFSSFSTSDKTALPL 
HAAALSRLPDAHEKAPPKRGFPCPPPKRSGEDDLVAFHLRRDTGTRREFAGQDQLRQRVLDPAL 
DGPLQRACAIDRVEADGNQLVQRLLAQFQAQLALGQALAQATELDLGDAGDLLASQRLEHHHFV 
DPVDEFRTEVRIDRVHHCGTLRLAVAGQLLDLRRTEVGGHHHHGVAEVHRTPVTVGQASVLEHL 
EENVEYIRMGLLHLVQQHHRVGLAADRLGQVAAFLEADVARRRADQAGHRVFLHELGHIYPHQR 
LLGIEEELGQRLAQLGLAHPGRAEEEERAARPVRIGEAGARTAHGVGHGDYRLVLADHSPMQLL 
LPIAQQLLALALEHLRHRDTGPLGNHFGDFLVGHLVAQQLVLGLAVLVDHLQAAFQVRDGLVLDA 
RHALEVALAPRRLHLLLGLLDLLLDLRRALHLGLLGLPDLLEVGVFALELDDILLQLGQALPGG 
F WFLLQRLALDLQLDQATVET I QFLRLGVDLHADAAGGL VDQVDGL VRQL P I GD VAVRQLGRG 
DDRAVGDAHPWHFIAFLEATEDGDGVFLARFVHQHLLEAALQRGILLDVLAILVEGSSTDAVQ 
LAARQSRLEHVAGVHGTFRIiAGADHGVQFVDEQDDPAFLLAQFVEDRLQAFLELAAELGTGDQR 
PHVQGQQALVLEAVRHFAVDDALGQALDDGGLADAGFADQHRWLGPPLQDLDGPADLWATDH 
RVELAFLGALGHVDGVL VQRLARLLDVRWHRFAATQ VGHG I LQRLARHALAEQQLAE PGVLVH 
RGQQYQLAGDELVALLLGQAVSLVEQACE I LGQVHVAGRALDLRQRVEFFVEAAAQGGD I EADL 
HQQGLDRTALLLEQGGKQVHRLDGRlVrVMANGQGLGVGERQLQLAGQTVYSHGSSFLL. 
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FIG. 55 

0RF7 nucleotide sequence SEQ ID NO: 119 

ATGATTAACAGTCATTTGCTCTACCACTGAGCTATCGCGGAACGTCTTTCTTCCAACCCTGGAC 
GCTTCCGGTGTTGCTGGATTCGCGTCTCAGAGGCGCGCCATTTTACGGATGCGCGCGGGCATGT 
CAACCCTCTGATCCAAAAAGTTTTTCTTCTTTTTCCACGAGCGACAAAACGGCCCTTCCACTGC 
ATGCGGCAGCGCTCTCGCGCCTACCGGACGCCCATGAAAAAGCCCCGCCGAAGCGGGGCTTTCC 
CTGTCCGCCCCCGAAGAGGTCAGGCGAAGACGATCTCGTCGCCTTCCACCTTCGCCGAGATACG 
GGCGTGCGCCATAGACCGGGTCGAAGCCGACGGCAATCAGCTTGTCCAGCGCCTCCTGGCTCAG 
TTCCAAGGCTCAGCTCGCGCTCGGCCAGGCGCTTGCGCAGGCGACCGAGCTGGATCTCGGCGAT 
GCCGGCGATCTGCTCGCGAGCCAGCGGCTCGAACACCACCACTTCGTCGATCCGGTTGATGAAT 
TCCGGACGGAAGTGCGCATTGACCGCGTCCATCACTGCGGCACGTTGCGCCTCGCGGTCGCCGG 
CCAGCTCCTGGATCTGCGCCGAACCGAGGTTGGAGGTCATCACCACCACGGTGTTGCGGAAGTC 
CACCGTACGCCCGTGACTGTCGGTCAGGCGTCCGTCCTCGAGCACCTGGAGGAGAATGTTGAAT 
ACATCCGGATGGGCCTTCTCCACCTCGTCCAGCAGCACCACCGAGTAGGGCTTGCGGCGGATCG 
CCTCGGTCAGGTAGCCGCCTTCCTCGAAGCCGACGTAGCCCGGAGGCGCGCCGATCAGGCGGGC 
CACCGAGTGTTTCTCCATGAACTCGGACATATCTATCCGCACCAGCGCCTCCTCGGTATCGAAG 
AGGAACTCGGCCAGCGCCTTGCACAACTCGGTCTTGCCCACCCCGGTCGGGCCGAGGAAGAGGA 
ACGAGCCGCTCGGCCGGTTCGGATCGGCGAGGCCGGCGCGCGAACGGCGCACGGCGTTGGACAC 
GGCGACTACCGCCTCGTCCTGGCCGATCACTCGCCGATGCAGCTCCTGCTCCATGCGCAGCAGC 
TTCTCGCGCTCGCCCTCGAGCATCTTCGACACCGGGATACCGGTCCACTTGGAAACCACTTCGG 
CGATTTCCTCGTCGGTCACCTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGTGCTGGTCGACC 
ATCTGCAGGCTGCGTTCCAGGTCCGGGATGGTCTGGTACTGGATGCGCGCCATGCTCTCGAGGT 
CGCCCTTGCGCCGCGCCGCCTCCATCTCCTGCTTGGCCTGCTCGATCTTCTGCTGGATCTGCGC 
CGAGCCCTGCACCTCGGCCTTCTCGGACTTCCAGATCTCCTCGAGGTCGGCGTATTCGCGCTCG 
AGCTTGACGATATCCTCCTCCAGCTTGGCCAGGCGCTTCCTGGTGGCTTCGTCGTCTTCCTTCT 
TCAGCGCCTCGCGCTCGATCTTCAGCTGGATCAGGCGACGGTCGAGACGATCCAGTTCCTCCGG 
CTTGGAGTCGATCTCCATGCGGATGCGGCTGGCGGCCTCGTCGATCAGGTCGATGGCCTTGTCC 
GGCAGTTGCCGATCGGTGATGTAGCGGTGCGACAGCTTGGCCGCGGCGATGATCGCGCCGTCGG 
TGATGCTCACCCGTGGTGCACTTCATAGCGTTCCTTGAGGCCACGGAGGATGGCGATGGTGTCT 
TCCTCGCTCGGTTCGTCCACCAGCACCTTCTGGAAGCGGCGCTCCAGCGCGGCATCCTTCTCGA 
TGTACTGGCGATACTCGTCGAGGGTAGTAGCACCGACGCAGTGCAGCTCGCCGCGCGCCAGAGC 
CGGCTTGAGCATGTTGCCGGCGTCCATGGCACCTTCCGCCTTGCCGGCGCCGACCATGGTGTGC 
AGTTCGTCGATGAACAGGATGACCCGGCCTTCCTGCTTGCCCAGTTCGTTGAGGACCGCCTTCA 
GGCGTTCCTCGAACTCGCCGCGGAACTTGGCACCGGCGATCAGCGCCCCCATGTCCAGGGCCAG 
CAGGCGCTTGTCCTTGAGGCCGTCCGGCACTTCGCCGTTGATGATGCGCTGGGCCAGGCCCTCG 
ACGATGGCGGTCTTGCCGACGCCGGGTTCGCCGATCAGCACCGGGTTGTTCTTGGTCCGCCGCT 
GCAGGACCTGGATGGTCCGGCGGATCTCGTCGTCGCGACCGATCACCGGGTCGAGCTTGCCTTC 
CTCGGCGCGCTTGGTCATGTCGACGGTGTACTTGTCCAGCGCCTGGCGCGACTCCTCGACGTTC 
GGGTCGTTCACCGCTTCGCCGCCACGCAGGTTGGCCACGGCATTCTCCAGCGCCTTGCGCGACA 
CGCCCTGGCCGAGCAGCAGCTTGCCGAGCCTGGTGTTCTCGTCCATCGCGGCCAGCAATACCAG 
CTCGCTGGAGATGAACTGGTCGCCCTTCTGCTGGGCCAGGCGGTCAGCCTGGTTGAGCAGGCGT 
GCGAGATCCTGGGACAGGTTCACGTCGCCGGTCGGGCTCTGGATCTTCGGCAGCGCGTCGAGTT 
CTTTGTTGAGGCCGCTGCGCAGGGCGGCGATATCGAAGCCGACCTGCATCAGCAGGGGCTTGAT 
CGAACCGCCTTGCTGCTCGAGCAGGGCGGAAAGCAGGTGCACCGGCTCGATGGCCGGATGGTCA 
TGGCCAACGGCCAGGGACTGGGCGTCGGAGAGCGCCAGTTGCAGCTTGCTGGTCAAACGGTCTA 
TTCGCATGGGTCGTCCTTCCTTCTATAG 
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FIG. 56 

clpB protein SEQ ID NO: 279 

MRIDRLTSKLQLALSDAQSLAVGHDHPAIEPVHLLSALLEQQGGSIKPLLMQVGFDIAALRSGL 

NKELDAL PKI QS PTGDVNLSQDLARLLNQADRLAQQKGDQF I S SELVLLAAMDENTRLGKLLLG 

QGVSRKALENAVANLRGGEAVNDPNVEESRQALDKYTVDMTKRAEEGKLDPVIGRDDEIRRTIQ 

VLQRRTKNNPVLIGEPGVGKTAI VEGLAQRI INGEVPDGLKDKRLLALDMGAL I AGAKFRGEFE 

ERLKAVLNELGKQEGRVI L F I DELHTMVGAGKAEGAMDAGNMLKPALARGELHCVGATTLDEYR 

QYIEKDAALERRFQKVLVDEPSEEDTIAILRGLKERYEVHHGVSITDGAIIAAAKLSHRYITDR 

QLPDKAIDLIDEAASRIRMEIDSKPEELDRLDRRLIQLKIEREALKKEDDEATRKRLAKLEEDI 

VKLEREYADLEEIWKSEKAEVQGSAQIQQKIEQAKQEMEAARRKGDLESMARIQYQTIPDLERS 

LQMVDQHGKTENQLLRNKVTDEEIAEWSKWTGIPVSKMLEGEREKLLRMEQELHRRVIGQDEA 

WAVSNAVRRSRAGLADPNRPSGSFLFLGPTGVGKTELCKALAEFLFDTEEALVRIDMSEFMEK 

HSVARLIGAPPGYVGFEEGGYLTEAIRRKPYSWLLDEVEKAHPDVFNILLQVLEDGRLTDSHG 

RTVDFRNTVVVMTSNLGSAQIQELAGDREAQRAAVMDAVNAHFRPEFINRIDEVWFEPIA 

I AGI AE I QLGRLRKRLAERELSLELSQEALDKL I AVGFDPVYGARPLKRAI QRWI ENPLAQLI L 

AGKFAPGAS I SAKVEGDE I VFA . 
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FIG. 57 



clpB DNA SEQ ID NO: 120 

ATGCGAATAGACCGTTTGACCAGCAAGCTGCAACTGGCGCTCTCCGACGCCCAGTCCCTGGCC 
GTTGGCCATGACCATCCGGCCATCGAGCCGGTGCACCTGCTTTCCGCCCTGCTCGAGCAGCAA 
GGCGGTTCGATCAAGCCCCTGCTGATGCAGGTCGGCTTCGATATCGCCGCCCTGCGCAGCGGC 
CTCAACAAAGAACTCGACGCGCTGCCGAAGATCCAGAGCCCGACCGGCGACGTGAACCTGTCC 
CAGGATCTCGCACGCCTGCTCAACCAGGCTGACCGCCTGGCCCAGCAGAAGGGCGACCAGTTC 
ATCTCCAGCGAGCTGGTATTGCTGGCCGCGATGGACGAGAACACCAGGCTCGGCAAGCTGCTG 
CTCGGCCAGGGCGTGTCGCGCAAGGCGCTGGAGAATGCCGTGGCCAACCTGCGTGGCGGCGAA 
GCGGTGAACGACCCGAACGTCGAGGAGTCGCGCCAGGCGCTGGACAAGTACACCGTCGACATG 
ACCAAGCGCGCCGAGGAAGGCAAGCTCGACCCGGTGATCGGTCGCGACGACGAGATCCGCCGG 
ACCATCCAGGTCCTGCAGCGGCGGACCAAGAACAACCCGGTGCTGATCGGCGAACCCGGCGTC 
GGCAAGACCGCCATCGTCGAGGGCCTGGCCCAGCGCATCATCAACGGCGAAGTGCCGGACGGC 
CTCAAGGACAAGCGCCTGCTGGCCCTGGACATGGGGGCGCTGATCGCCGGTGCCAAGTTCCGC 
GGCGAGTTCGAGGAACGCCTGAAGGCGGTCCTCAACGAACTGGGCAAGCAGGAAGGCCGGGTC 
ATCCTGTTCATCGACGAACTGCACACCATGGTCGGCGCCGGCAAGGCGGAAGGTGCCATGGAC 
GCCGGCAACATGCTCAAGCCGGCTCTGGCGCGCGGCGAGCTGCACTGCGTCGGTGCTACTACC 
CTCGACGAGTATCGCCAGTACATCGAGAAGGATGCCGCGCTGGAGCGCCGCTTCCAGAAGGTG 
CTGGTGGACGAACCGAGCGAGGAAGACACCATCGCCATCCTCCGTGGCCTCAAGGAACGCTAT 
GAAGTGCACCACGGGGTGAGCATCACCGACGGCGCGATCATCGCCGCGGCCAAGCTGTCGCAC 
CGCTACATCACCGATCGGCAACTGCCGGACAAGGCCATCGACCTGATCGACGAGGCCGCCAGC 
CGCATCCGCATGGAGATCGACTCCAAGCCGGAGGAACTGGATCGTCTCGACCGTCGCCTGATC 
CAGCTGAAGATCGAGCGCGAGGCGCTGAAGAAGGAAGACGACGAAGCCACCAGGAAGCGCCTG 
GCCAAGCTGGAGGAGGATATCGTCAAGCTCGAGCGCGAATACGCCGACCTCGAGGAGATCTGG 
AAGTCCGAGAAGGCCGAGGTGCAGGGCTCGGCGCAGATCCAGCAGAAGATCGAGCAGGCCAAG 
CAGGAGATGGAGGCGGCGCGGCGCAAGGGCGACCTCGAGAGCATGGCGCGCATCCAGTACCAG 
ACCATCCCGGACCTGGAACGCAGCCTGCAGATGGTCGACCAGCACGGCAAGACCGAGAACCAG 
TTGCTGCGCAACAAGGTGACCGACGAGGAAATCGCCGAAGTGGTTTCCAAGTGGACCGGTATC 
CCGGTGTCGAAGATGCTCGAGGGCGAGCGCGAGAAGCTGCTGCGCATGGAGCAGGAGCTGCAT 
CGGCGAGTGATCGGCCAGGACGAGGCGGTAGTCGCCGTGTCCAACGCCGTGCGCCGTTCGCGC 
GCCGGCCTCGCCGATCCGAACCGGCCGAGCGGCTCGTTCCTCTTCCTCGGCCCGACCGGGGTG 
GGCAAGACCGAGTTGTGCAAGGCGCTGGCCGAGTTCCTCTTCGATACCGAGGAGGCGCTGGTG 
CGGATAGATATGTCCGAGTTCATGGAGAAACACTCGGTGGCCCGCCTGATCGGCGCGCCTCCG 
GGCTACGTCGGCTTCGAGGAAGGCGGCTACCTGACCGAGGCGATCCGCCGCAAGCCCTACTCG 
GTGGTGCTGCTGGACGAGGTGGAGAAGGCCCATCCGGATGTATTCAACATTCTCCTCCAGGTG 
CTCGAGGACGGACGCCTGACCGACAGTCACGGGCGTACGGTGGACTTCCGCAACACCGTGGTG 
GTGATGACCTCCAACCTCGGTTCGGCGCAGATCCAGGAGCTGGCCGGCGACCGCGAGGCGCAA 
CGTGCCGCAGTGATGGACGCGGTCAATGCGCACTTCCGTCCGGAATTCATCAACCGGATCGAC 
GAAGTGGTGGTGTTCGAGCCGCTGGCTCGCGAGCAGATCGCCGGCATCGCCGAGATCCAGCTC 
GGTCGCCTGCGCAAGCGCCTGGCCGAGCGCGAGCTGAGCCTGGAACTGAGCCAGGAGGCGCTG 
GACAAGCTGATTGCCGTCGGCTTCGACCCGGTCTATGGCGCACGCCCGCTGAAGCGGGCCATC 
CAGCGCTGGATCGAGAACCCGCTGGCGCAACTGATCCTGGCCGGCAAATTCGCGCCGGGTGCC 
AGTATCTCGGCGAAGGTGGAAGGCGACGAGATCGTCTTCGCCTGA 
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FIG. 60A 
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FIG. 61 
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FIG. 63 
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X. axonopodis pv. dtri 306 / AAM37094 


RL068 


56 


2.6 X 10' 


no significant similarity 


RL090 


67 


2.7 X 10< 


no significant similarity 


RL092 


0 


1.3x10' 


topoisomerase i TopA, X. fastidiosa 9a5c (plasmid pXF51) / 
AAF85572 


RL095 


50 


5.3x105 


singie-stranded DMA binding protein Ssb, P. aenjginosa C / 
AAN62318 


RL101 


38 


1.8x106 


Pseudomonas sp. B13 / CAD60668 


RL112 


38 


1.6 X 10* 


no significant similarity 


RS06 


100 


1.8 X 10' 


P. aenjginosa PA01 / AAG04369 
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FIG. 67 

ORF 7 (SEQ ID NO: 280) 

LEFGSATWTRTRDPMINSHLLYRLSYRGTSFFQPWTLPVLLDSRLRGA PFYGC7U^\CQPSDPKSFSSFSTSDKTAL 

PLHAAALSRLPDAHEKAPPKRGFPCPPPKRSGEDDLVAFHLRRDTGTRREFAGQDQLRQRVLDPALDGPLQRACAI 

DRVEADGNQLVQRLLAQFQAQLiALGQALAQATELDLGDAGDLLASQRLEHHHFVDPVDEFRTEVRIDRVHHCGTLR 

IJ\VAGQLLDLRRTEVGGHHHHGVAEVHRTPVTVGQASVLEHLEENVEYIRMGLLHLVQQHHRVGLAADRLGQV^ 

LEADVARRRADQAGHRVFLHELGHIYPHQRLLGIEEELGQRLAQLGLAHPGRAEEEERAARPVRIGEAGARTAHGV 

GHGDYRLVLADHSPMQLLLHAQQLIiAIJ^LEHLRHRDTGPLGNHFGDFLVGHLVAQQLVLGLAVLVDHLQAAFQVRD 

GLVLDARHALEVALAPRRLHLLLGLLDLLLDLRRALHLGIiLGLPDLLEVGVFALELDDILLQLGQALPGGFWFLL 

QRLALDLQLDQATVETIQFLRLGVDLHADAAGGLVDQVDGLVRQLPIGDVAVRQLGRGDDRAVGDAHPVVHFIAFL 

EATEDGDGVFLARFVHQHLLETU^LQRGILLDVIiAILVEGSSTDAVQLAARQSRLEHVAGVHGTFRLAGADHGVQFV 

DEQDDPAFLI^QFVEDRLQAFLELAAELGTGDQRPHVQGQQALVLEAVRHFAVDDALGQALDDGGLADAGFADQHR 

WLGPPLQDLDGPADLWATDHRVELAFLGALGHVDGVLVQRLARLLDVRVVHRFAATQVGHGILQRLARHALAEQ 

QLAEPGVLVHRGQQYQLAGDELVALLLGQAVSLVEQACEILGQVHVAGRALDLRQRVEFFVEAAAQGGDIEADLHQ 

QGLDRT ALLLEQGGKQVHRLDGRMVMANGQGLGVGERQLQLAGQTVY SHGS S FLL 



ORF7 (SEQ ID NO: 281) 

TTGGAATTTGGCTCCGCGACCTGGACTCGAACCAGGGACCCA ATGATTAACAGTCATTTGCTCTACCGACTGAGCT 
ATCGCGGAACGTCTTTCTTCCAACCCTGGACGCTTCCGGTGTTGCTGGATTCGCGTCTCAGAGGCGCGCCATTTTA 
CGGATGCGCGCGGGCATGTCAACCCTCTGATCCAAAAAGTTTTTCTTCTTTTTCCACGAGCGACAAAACGGCCCTT 
CCACTGCATGCGGCAGCGCTCTCGCGCCTACCGGACGCCCATGAAAAAGCCCCGCCGAAGCGGGGCTTTCCCTGTC 
CGCCCCCGAAGAGGTCAGGCGAAGACGATCTCGTCGCCTTCCACCTTCGCCGAGATACTGGCACCCGGCGCGTU^TT 
TGCCGGCCAGGATCAGTTGCGCCAGCGGGTTCTCGATCCAGCGCTGGATGGCCCGCTTCAGCGGGCGTGCGCCATA 
GACCGGGTCGAAGCCGACGGCAATCAGCTTGTCCAGCGCCTCCTGGCTCAGTTCCAGGCTCAGCTCGCGCTCGGCC 
AGGCGCTTGCGCAGGCGACCGAGCTGGATCTCGGCGATGCCGGCGATCTGCTCGCGAGCCAGCGGCTCGAACACCA 
CCACTTCGTCGATCCGGTTGATGAATTCCGGACGGAAGTGCGCATTGACCGCGTCCATCACTGCGGCACGTTGCGC 
CTCGCGGTCGCCGGCCAGCTCCTGGATCTGCGCCGAACCGAGGTTGGAGGTCATCACCACCACGGTGTTGCGGAAG 
TCCACCGTACGCCCGTGACTGTCGGTCAGGCGTCCGTCCTCGAGCACCTGGAGGAGAATGTTGAATACATCCGGAT 
GGGCCTTCTCCACCTCGTCCAGCAGCACCACCGAGTAGGGCTTGCGGCGGATCGCCTCGGTCAGGTAGCCGCCTTC 
CTCGAAGCCGACGTAGCCCGGAGGCGCGCCGATCAGGCGGGCCACCGAGTGTTTCTCCATGAACTCGGACATATCT 
ATCCGCACCAGCGCCTCCTCGGTATCGAAGAGGAACTCGGCCAGCGCCTTGCACAACTCGGTCTTGCCCACCCCGG 
TCGGGCCGAGGAAGAGGAACGAGCCGCTCGGCCGGTTCGGATCGGCGAGGCCGGCGCGCGAACGGCGCACGGCGTT 
GGACACGGCGACTACCGCCTCGTCCTGGCCGATCACTCGCCGATGCAGCTCCTGCTCCATGCGCAGCAGCTTCTCG 
CGCTCGCCCTCGAGCATCTTCGACACCGGGATACCGGTCCACTTGGAAACCACTTCGGCGATTTCCTCGTCGGTCA 
CCTTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGTGCTGGTCGACCATCTGCAGGCTGCGTTCCAGGTCCGGGAT 
GGTCTGGTACTGGATGCGCGCCATGCTCTCGAGGTCGCCCTTGCGCCGCGCCGCCTCCATCTCCTGCTTGGCCTGC 
TCGATCTTCTGCTGGATCTGCGCCGAGCCCTGCACCTCGGCCTTCTCGGACTTCCAGATCTCCTCGAGGTCGGCGT 
ATTCGCGCTCGAGCTTGACGATATCCTCCTCCAGCTTGGCCAGGCGCTTCCTGGTGGCTTCGTCGTCTTCCTTCTT 
CAGCGCCTCGCGCTCGATCTTCAGCTGGATCAGGCGACGGTCGAGACGATCCAGTTCCTCCGGCTTGGAGTCGATC 
TCCATGCGGATGCGGCTGGCGGCCTCGTCGATCAGGTCGATGGCCTTGTCCGGCAGTTGCCGATCGGTGATGTAGC 
GGTGCGACAGCTTGGCCGCGGCGATGATCGCGCCGTCGGTGATGCTCACCCCGTGGTGCACTTCATAGCGTTCCTT 
GAGGCCACGGAGGATGGCGATGGTGTCTTCCTCGCTCGGTTCGTCCACCAGCACCTTCTGGAAGCGGCGCTCCAGC 
GCGGCATCCTTCTCGATGTACTGGCGATACTCGTCGAGGGTAGTAGCACCGACGCAGTGCAGCTCGCCGCGCGCCA 
GAGCCGGCTTGAGCATGTTGCCGGCGTCCATGGCACCTTCCGCCTTGCCGGCGCCGACCATGGTGTGCAGTTCGTC 
GATGAACAGGATGACCCGGCCTTCCTGCTTGCCCAGTTCGTTGAGGACCGCCTTCAGGCGTTCCTCGAACTCGCCG 
CGGAACTTGGCACCGGCGATCAGCGCCCCCATGTCCAGGGCCAGCAGGCGCTTGTCCTTGAGGCCGTCCGGCACTT 
CGCCGTTGATGATGCGCTGGGCCAGGCCCTCGACGATGGCGGTCTTGCCGACGCCGGGTTCGCCGATCAGCACCGG 
GTTGTTCTTGGTCCGCCGCTGCAGGACCTGGATGGTCCGGCGGATCTCGTCGTCGCGACCGATCACCGGGTCGAGC 
TTGCCTTCCTCGGCGCGCTTGGTCATGTCGACGGTGTACTTGTCCAGCGCCTGGCGCGACTCCTCGACGTTCGGGT 
CGTTCACCGCTTCGCCGCCACGCAGGTTGGCCACGGCATTCTCCAGCGCCTTGCGCGACACGCCCTGGCCGAGCAG 
CAGCTTGCCGAGCCTGGTGTTCTCGTCCATCGCGGCCAGCAATACCAGCTCGCTGGAGATGAACTGGTCGCCCTTC 
TGCTGGGCCAGGCGGTCAGCCTGGTTGAGCAGGCGTGCGAGATCCTGGGACAGGTTCACGTCGCCGGTCGGGCTCT 
GGATCTTCGGCAGCGCGTCGAGTTCTTTGTTGAGGCCGCTGCGCAGGGCGGCGATATCGAAGCCGACCTGCATCAG 
CAGGGGCTTGATCGAACCGCCTTGCTGCTCGAGCAGGGCGGAAAGCAGGTGCACCGGCTCGATGGCCGGATGGTCA 
TGGCCAACGGCCAGGGACTGGGCGTCGGAGAGCGCCAGTTGCAGCTTGCTGGTCAAACGGTCTATTCGCATGGGTC 
GTCCTTCCTTCTATAG 

>| 
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FIG. 68 

(SEQ ID NO: 121) 

ac gtcgggg gcgcattgctacgcctgc agaatgg tttcagggccttagaaacagaaaagccca 
cc^^^^ggcgggctattccatattga^s^^gtcaatgcgggcctaatgttcggcccaga 

cggctgctagacaagaaccggcgtaacaccccttcctagcctatgcaactcgcccccQagaaa 
atggtgggtcgtgtaggattcgaacctacgaccaattggttaaaagccaactgctctaccgac 
tgagctaacgaccca agtatgag gtggtcggggtagagagattcgaactcccgacatcctgct 
cccaaagcaggcgcgctaccggactgcgctataccccgattggaatttggctccgcgacctgg 
actcgaaccagggacccaatg 
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FIG. 69 
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FIG. 70A 

CLUSTAL W (1.82) multiple sequence alignment 

CF2 (SEQ ID NO: 122) GATGAAGG-ACCCGAGCGGAACATCCATCTCAC 32 

PA037(SEQ ID NO: 123) GATGAAGG-ACCCGAGCGGAACATCCATCTCAC 32 

CF6 (SEQ ID NO: 124) GATGAAGGCACCCGAGCGGAACATCCACCTCAC 33 

PA14 (SEQ ID NO: 125) TTTCCGGCACACCCTGGCCACCGACTTGATGAAGGCACCCGAGCGGAACATTCACCTCAC 60 

CF26 (SEQ ID NO: 126) GATGAAGG-ACCCGAGCGGAACATCCACCTCAC 32 



CF2 GAAGTGCCTGCTCAACCACTCGAATATCCAGACCACCATGAGCTACATCGAGGCCGACTA 92 

PA037 GAAGTGCCTGCTCAACCACTCGAATATCCAGACCACCATGAGCTACATCGAGGCCGACTA 92 

CF6 GAAGTGCCTGCTCAACCACTCGAATATCCAGACCACCATGAGCTACATCGAGGCCGACTA 93 

PA14 GAAGTGCCTGCTCAACCACTCGAATATCCAGACCACGATGAGCTACATCGAGGCCGACTA 120 

CF26 GAAGTGCCTGCTCAACCACTCGAATATCCAGACCACGATGAGCTACATCGAGGCCGACTA 92 



CF2 CGACCACATGCGTGCCXSTGCTGCATGCCAGAAGCCTGGCCCAAGGAGCGCTGGAGAACGT 152 

PA037 CGACCACATGCGTGCCGTGCrrGCATGCCAGAAGCCTGGCCCAAGGAGCGCTGGAGAACGT 152 

CF 6 CGACCACATGCGTGCCGTGCTGCATGCCAGAAGCCTGGCCCAAGGAGCGCTGGAGAACGT 153 

PA14 CGATCACATGCGTGCCGTGCTGCATGCTAGAAGCCTGGCCCAAGGCGCGCTGGAGAATGT 180 

CF26 CGATCACATGCGTGCCGTGCTGCATGCTAGAAGCCTGGCCCAAGGCGCGCTGGAGAATGT 152 

*** *********************** ***************** *********** ** 

CF2 CAGGAAGGTGGATTACAGCGGCTCCCCGCAAGCCTCTGCCAAACCGAAGCCATGCGGGCA 212 

PA037 CAGGAAGGTGGATTACAGCGGCTCCCCGCAAGCCTCTGCCAAACCGAAGCCATGCGGGCA 212 

CF6 CAGGAAGGTGG ATTACAGCGGCTCCCCGCAGGCCTCTGCCAAACCGAAGCCATGCGGGC A 213 

PA14 CAGGAAGGTGGATTACAGCGGCTCCCCGCAAGCCTCTGCCAAACCGAAGCCATGCGGGCA 24 0 

CF26 CAGGAAGGTGGATTACAGCGGCTCCCCGCAAGCCTCTGCCAAACCGAAACCATGCGG6CA 212 



CF2 ACCTCTCGCTCGAATGGGTGAAGTACCGCCGCCGGAGGCCAGGACAGAACCTGCAGAACC 272 

PA037 ACCTCTCGCTCGAATGGGTGAAGTACCGCCGCCGGAGGCCAGGACAGAACCTGCAGAACC 272 

CF6 ACCTCTCGCTCGAATGGGTGAAGTACCGCCGCCGGAGGCCAGGACAGAACCTGCAGAACC 273 

PA14 ACCTCTCGCTCGAGTGAGTGAAGCGCCGCCACCGGAGGCCAGGACAGAGCCTGCAGAACC 300 

CF2 6 ACCTCTCGCTCGAGTGAGTGAAGOGCCGCCACCGGAAGCCAGGACAGAGCCTGCAGAACC 272 

************* ** ****** ***** ***** *********** *********** 

CF2 AAGGGAGCACATACCAGGGACAGGCATTCAGGGAGGTCCAACCGTGCGGGAAGAAG C 329 

PA03 7 AAGGGAGCACATACCAGGGACAGGCATTCAGGGAGGTCCAACCGTGCGGGAAGAAG C 3 29 

CF 6 AAGGGAGCACACACCAGGGACAGGCATTCAGGGAGGTCCAACCGTGCGGGAAGAAG C 330 

PA14 AAGGGAGCACACGCCAGGGACAGGCATTCAGGGAGGTCCAACCGCGTGGGAAGCAGATGC 360 

CF2 6 AAGGGAGCACACACCAGGGACAGGCATTCAGGGAGGTCCAACCGAGTGGGAAGCAGAAGC 332 

*********** ******************************* * ****** ** * 

CF2 GCTACCACAGCCACCTGACACCTTCGACCA7VAGCGTGCTGTTCACTCTGATGGCTCAACA 3 89 

PA03 7 GCTACCACAGCCACCTGACACCTTCGACCAAAGCGTGCTGTTCACTCTGATGGCTCAACA 3 89 

CF6 GCTACCACAGCCACCTGACACCTTCGATCAAAGCGTGCTGTTCACTCTGATGGCTCAACA 390 

PA14 GCTACCACAGCCACCTGACACCTTCGAACCAAGCGTGCTGTTCACTCTGATGGCTCAAAA 420 

CF26 GCTACCACAGCCACCTGACACCTTCGAGCAAAGCGTGCTGTTCACTCTGATGGCTCAACA 392 

*************************** * **************************** * 

CF2 CTTATCGAACCGTGCCGCCTCGGCATCCGCGGCTCCCGCTGCAACAAGCGGATCTGGTGG 449 

PA037 CTTATCGAACCGTGCCGCCTCGGCATCCGCGGCTCCCGCTGCAACAAGCGGATCTGGTGG 449 

CF6 CTTATCGAACCGTGCCGCCTCGGCATCCGCAGCTCCCX3CTGC;U^CAAGCGGATCTGGTGG 450 

PA14 CTTATCGAACCGTGCCGCCTCGGCATCCGCGGCTCCCGCTGCAACAAGCGGATCAGGCGG 480 > 

CF2 6 CTTATCGAACCGTGCCGCCACGACATCTGCGGCTCCCGCCX3CAACCAGCGGATCTTGTAG 452 

******************* ** **** ** ******** ***** ******** * * 

CF2 ATGGGGATCTACTGCCCGAAGCAGTCTCGCCTAGCGATACCGATACTGAAGGGCCGGCTA 50 9 

PA03 7 ATGGGGATCTACTGCCCGAAGCAGTCTCGCCTAGCGATACCGATACTGAAGGGCCGGCTA 50 9 

CF6 ATGGGGATCTACCGCCCGAAGCAGTCTCGCCTAGCGATACCGGTACTGAAGGGCCGGCTA 510 

PA14 ATGGGGATCTGCCGCCCGAAGCAATCTCGCCTAGCGATACCGGTACTGA-GGGCCGGCTA 53 9 

CF2 6 ATGGGGATCTGCCGCCCGAAGCAGCCTCGCCTAGCGATACCGGTACTGAGGGGCCGGCTA 512 

********** * ********** ***************** ****** ********** 
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FIG. 70B 



CF2 CCGGACGAAAGGTAGCCGCGCCTCCCAGCAGTTCGCTAGGCCTGTAAGAAAAATCTGGAA 569 

PA037 CCGGACGAAAGGTAGCCGCGCCTCCCAGCAGTTCGCTAGGCCTGTAAGAAAAATCTGGAA 569 

CF6 CCGGACGAAAGGTAGCCGCGCCTCCCAGCAGTTCGCTAGGCCTGTAGGAAAAATCTGGAA 570 

PA14 CCGGACGAAAGGTAGCCGTGCCTTCCAGCAGATCGTTAGGCCTGTAGGAAAAATCTGGAA 599 

CF2 6 CCAGACGAAAGGTAGCCGCGCCTCCCAGCAGATCGCTGGGCCTGTAGGAAAAATCTGGAA 572 

*★ *************** **** ******* *** * ******** ************* 

CF2 TTACCGAGAGCGCCTGGATTCCAGCGCCGGCATGCTGGCAGAGCCC - CGCAGTTTCACGG 62 8 

PA03 7 TTACCGAGAGCGCCTGGATTCCAGCGCCGGCATGCTGGCAGAGCCC - CGCAGTTTCACGG 628 

CF6 TTACCGAGAGCGCCTGGATTCCAGCGCCGGCATGCTGGCAGGGCCC - CGCAATTTCAAGG 629 

PA14 TTACCGAGAGCGCCTGGATTCCAGCGCCGGCATGCTGGCAGAGCCAGCGCAATTTCAAGG 659 

CF2 6 TTACCGAGAGCGCCTGGATTCCAGCGCCGGCATGCTGGCAGAGCCC - CGCAATTTCACGG 631 

* *************************************** * *** **** ***** ** 

CF2 CCAAAACCGCAGTACCCTCTGTAATCGCTGATTACGTCGGGGGCGCATTGCTACGCCTGC 688 

PA03 7 CCAAAACCGCAGTACCCTCTGTAATCGCTGATTACGTCGGGGGCGCATTGCTACGCCTGC 688 

CF6 C-GAAACCGCAGTACCCTCTGTAATCGCTGATTACGTCGAGGGCACATTGCTACGCCTGC 688 

PA14 CCAATACCACAGTACCCTCTGTAATCGCTGATTACGTCGGGGGCGCATTGCTACGCCTGC 719 

CF26 C-AAAACCGCAGTACCCTCTGTAATCGCTGATTACGTCGGGGGCACATTGCTACGCCTGC 690 

* * *** ****************************** **** *************** 

CF2 AGAA - TGGTTTCAGGGCCTTANAAACAGAAAAGCCCACCTTAAATAGGCGGGCTATT - CC 74 6 

PA03 7 AGAAATGGTTTCAGGGCCTTAGAAACAGAAAAGCCCACCCTAAATAGGCGGGCTATT - CC 74 7 

CF6 AGAA - TGGTTTCAGAGCCT - GAAAACAGAAAAGNCCACC - TAAATAGGCGGGCTATTTCC 74 5 

PA14 AGAA-TGGTTTCAGGGCCTTAGAAACAGAAAAGCCCACC-TAGAAAGGCGGGCTATT-CC 776 

CF2 6 AGAA - TGGTTTCAGAGCCTTANAAACAGAAAAGCCCACC - TAGATAGGCGGGCTATT - CC 747 

*★** ********* **** *********** ***** ** * ************ ** 

CF2 ATATT - GACATCACG - TCAATGCGGG - - CCTAATGTTC - - GGCCCAGACGGCTG - - CTGG 798 

PA03 7 ATATT - GACATCACG - TCAATGCGGG - - CCTAATGTTC - - GGCCCANACGGCTG - - CTGG 799 

CF6 ATATTTGACATCCCG- TCAATGCGGGGCCCTAATGGTTCGGGCCCANACGGCTTGCTTGG 804 

PA14 ATATT -GACATCACG -TCAATGCGGG --CCTAATGTTC-- GGCCCAGACGGCTG --CTAG 828 

CF26 ATATT -GACATCACGGTCAATGCGGG--GCTAATGTTC-GGGCCCANACGGNTG CAA 800 

***** ****** ** ********** ****** * ****** **** * 
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